various software

The following software may be useful to you. It is certainly not documented or tested as well as it should be, but that's academic software for you. All code is released under the GPLv2 license. Almost all code is in Python.

  • parsedyff: Visualize the differences between two treebank parse trees via graphviz
  • Parsing: Python module with parsing related functions (running/training the Charniak parser, tree reading, evaluation)
  • PyInputTree: Python interface to the InputTree structure from the Charniak parser via SWIG. This lets you traverse and view Treebank-style trees.
  • waterworks: My Python utility library (everyone else has one...)

Available by request:

  • Hogwash: Metaclustering system (only works with the Brown CS setup, but could be modified to work with other systems). I also use this to keep track of results from my experiments, so it is a basic persistence mechanism and results database. This code is probably not useful for those not familiar with it, unfortunately and the system has a steep learning curve.