Posted on 2008-09-26 00:27:10-07 by djame
Some suggestions
Hi,
thanks for this great module..

I've done some very light hacking (against version 0.14) to Lingua::Treebank to make it produce some tagged output from a given treebank

change to get-words
-tags print POS TAGS in Bikel's format i.e "(WORD (POS))"
-tags_only print POS TAGS only in usual format (POS1 .. POSN)

that would be great to produce as well some other format (such as chiang's one (chiang 2001) (POS WORD) ..(POSN WORDN)
and to make it produce perfectaggging mode for any parser
One way to simulate perfectagging is to prefix the words by its tag so
(POS WORD) becomes (POS POS_WORD)

another change I've made is to prefix all the script by TB in order to be able to find them moreeasy using tab-completion
here is a link to the modified version I've made
http://pauillac.inria.fr/~seddah/Lingua-Treebank-modified.tgz

Feel free to include the change if you want.


Cheers,
Djame (djame.seddah@free.fr)
Direct Responses: Write a response
Perl Weekly newsletter
A free weekly newsletter for people who are busy to read all the blogs. click here to check it out.