[Sigwac] Cleaneval data request
Jérôme Thièvre
jthievre at gmail.com
Tue Jul 6 12:10:27 CEST 2010
Hello,
I know that the clean competition is over, but I would like to test my
boilerplate removal algorithm against the cleaneval evaluation datasest to
compare with state of the art solutions.
We are building a fulltext indexing process and boilerplate removal is one
step of this process.
Sincerely, Jérôme
Name(s): THIEVRE Jérôme
Affiliation: Institut National de l'Audiovisuel
Country you work in: France
Contact email: jerome.thievre at gmail.com
Short name for participating system: INA html cleaner
Student yes/no : no
Participating for
Chinese: (yes/no) : no
English: (yes/no) : yes
More information about the Sigwac
mailing list