Jesus Vilares' NLP & IR bookmarks





GENERAL PURPOSE RESOURCES (mainly)



ACL (Special Interest Group on the Lexicon) SIGLEX



BNC: English Language Corpora and Corpus resources



TREC (Text REtrieval Conference)


Signals, Speech, & Language Lab at the Univ. of Washington: Multilingual NLP Resources
 


Standford NLP Research Group
*Tools: Taggers, Parsers, NER, NP chunking, Language models, Concordances, Summarization, Other
*Corpora: Large collections, Particular languages, Treebanks, Discourse, WSD, Literature, Acquisition
*SGML/XML
*Dictionaries
*Lexical/morphological resources
*Courses, Syllabi, and other Educational Resources
*Mailing lists
*Other stuff on the Web: General, IR, IE/Wrappers, People, Societies



ISSCO Research Group


Kenji Kita's personal page
 


David Lee's Bookmarks for Corpus-based Linguists



Mary D. Taffet's personal page: WWW Sites for Linguistics, Natural Language Processing, and Data Mining



Telcordia Latent Semantic Indexing (LSI)



Latent Semantic Analysis (LSA) @ CU Boulder


LSI - Latent Semantic Indexing Web Site






MULTILINGUAL RESOURCES (mainly)




Multext tools
 


PLUG Word Aligner - PWA Uplug corpus tools



OPUS - an open source parallel corpus



Aligned Hansards of the 36th Parliament of Canada (english-french)



University of Maryland Parallel Corpus Project


Oslo Multilingual Corpus (OMC) site



Parallel Corpora in Uppsala



Non-English, Parallel & Multilingual Corpora site



Project Gutenberg Free eBook Library


Free Online Dictionaries



Multlingva Tradukvortaro - Multilingual Translation Dictionary
 


Wordlists



The Bible Tool
 


BibleGateway.com
 


ARTFL Project: Multi-Lingual Bibles


Dept. of Linguistics at the Chulalongkorn University (Thailand): Corpus resources


Computing Research Lab's (CRL) at the New Mexico State University


Franz Josef Och's personal page


Rada Mihalcea's personal page


Pascale Fung's personal page


Pamela Forner's personal page



Emily M. Bender's personal page



Manuel Barbera's Reference Guide to Corpora and Corpus-based Computational Linguistics Resources



CLEF (Cross Language Evaluation Forum)



NTCIR (NII-Test Collection for IR)



ARCADE





Last update: 26/11/2005