This english corpus is divided in texts for training and testing. For each genre there are respectively 20 HTML files and their tagged versions.
This english corpus is divided in texts for training and testing. For each genre there are respectively 20 HTML files and their tagged versions.