Hi,
thanks for this great module..
I've done some very light hacking (against version 0.14) to Lingua::Treebank to make it produce some tagged output from a given treebank
change to get-words
-tags print POS TAGS in Bikel's format i.e "(WORD (POS))"
-tags_only print POS TAGS only in usual format (POS1 .. POSN)
that would be great to produce as well some other format (such as chiang's one (chiang 2001) (POS WORD) ..(POSN WORDN)
and to make it produce perfectaggging mode for any parser
One way to simulate perfectagging is to prefix the words by its tag so
(POS WORD) becomes (POS POS_WORD)
another change I've made is to prefix all the script by TB in order to be able to find them moreeasy using tab-completion
here is a link to the modified version I've made
http://pauillac.inria.fr/~seddah/Lingua-Treebank-modified.tgz
Feel free to include the change if you want.
Cheers,
Djame (djame.seddah@free.fr)