ODIN

The Online Database of Interlinear Text

 

Publications and other papers related to ODIN:

William Lewis and Fei Xia, Developing ODIN: A Multilingual Repository of Annotated Language Data for Hundreds of the World's Languages , in Literary and Linguistic Computing, Oxford University Press, September 2010.http://llc.oxfordjournals.org/cgi/reprint/fqq006?ijkey=ftq3CprBBTF1zxU&keytype=ref

Ryan Georgi, Fei Xia, and William Lewis, Comparing Language Similarity Across Genetic and Typologically-Based Groupings, in Proceedings of the 23rd International Conference on Computational Linguistics (COLING 2010), International Conference on Computational Linguistics, August 2010. http://www.aclweb.org/anthology-new/C/C10/C10-1044.pdf

Fei Xia, Carrie Lewis, and William Lewis, The Problems of Language Identification within Hugely Multilingual Data Sets, in Proceedings of the 7th International Conference on Language Resources and Evaluation (LREC 2010), European Language Resources Association, May 2010.http://www.lrec-conf.org/proceedings/lrec2010/pdf/921_Paper.pdf

Fei Xia, Carrie Lewis, and William Lewis, Language ID for a Thousand Languages, in eLanguage, LSA Annual Meeting Extended Abstracts, Linguistics Society of America, January 2010.http://elanguage.net/journals/index.php/lsameeting/article/viewFile/506/601

Xia, F., Lewis, W. D., and Poon, H. (2009), ‘Language ID in the Context of Harvesting Language Data off the Web’, in Proceedings of The 12th Conference of the European Chapter of the Association of Computational Linguistics (EACL), Athens, Greece, March 2009. http://faculty.washington.edu/wlewis2/papers/EACL-XLP-2009.pdf

Lewis, W. D. & Xia, F. (2009), ‘Parsing, Projecting & Prototypes: Repurposing Linguistic Data on the Web’, in Proceedings of The 12th Conference of the European Chapter of the Association of Computational Linguistics (EACL), Athens, Greece, March 2009. http://faculty.washington.edu/wlewis2/papers/Lewis-Xia-EACL-2009.pdf

Xia, F. & Lewis, W. D. (2009), ‘Applying NLP Technologies to the Collection and Enrichment of Language Data on the Web to Aid Linguistic Research’, in Proceedings of The 12th Conference of the European Chapter of the Association of Computational Linguistics (EACL), Athens, Greece, March 2009. http://faculty.washington.edu/wlewis2/papers/xia-lewis-eacl-2009.pdf

Lewis, W. D. & Xia, F. (2008), ‘Automatically Identifying Computationally Relevant Typological Features’, to appear in Proceedings of The Third International Joint Conference on Natural Language Processing (IJCNLP). Hyderabad, January 2008. http://faculty.washington.edu/wlewis2/papers/LewisXia-ijcnlp08t-06.pdf

Xia, F. & Lewis, W. D. (2008), ‘Repurposing Theoretical Linguistic Data for Tool Development and Search’, to appear in Proceedings of The Third International Joint Conference on Natural Language Processing (IJCNLP). Hyderabad, January 2008. http://faculty.washington.edu/wlewis2/papers/XiaLewis-ijcnlp08d-12.pdf

Xia, F. & Lewis, W. D. (2007), ‘Multilingual Structural Projection across Interlinearized Text’, in The Annual Conference of the North American Chapter of the Association for Computational Linguistics (NAACL-HLT 2007), Rochester, NY, April 22-27, 2007. http://faculty.washington.edu/wlewis2/papers/xl-naacl07-16.pdf

Lewis, W. D. (2006), ODIN: A Model for Adapting and Enriching Legacy Infrastructure, in ‘Proceedings of the e-Humanities Workshop, held in cooperation with e-Science 2006: 2nd IEEE International Conference on e-Science and Grid Computing’, Amsterdam. http://faculty.washington.edu/wlewis2/papers/ODIN-eH06.pdf

Farrar, S. O. & Lewis, W. D. (2006), ‘The GOLD Community of Practice: An Infrastructure for Linguistic Data on the Web’, Language Resources and Evaluation. http://faculty.washington.edu/wlewis2/papers/FarLew06.pdf

Lewis, W. D. Xia, F. & Jinguji, D. (2006), ‘Enriching Language Data through Projected Structures’, in , The Workshop on Computational Linguistics for Less-studied Languages, organized by Texas Linguistics Society (TLSX), Austin, Texas, Nov 3-5, 2006. http://faculty.washington.edu/wlewis2/papers/LewisXiaJinguji06.pdf

Lewis, W. D., Farrar, S. & Langendoen, D. T. (2006), Linguistics in the internet age: Tools and fair use, in ‘Proceedings of the EMELD06 Workshop on Digital Language Documentation: Tools and Standards: The State of the Art’, Lansing, MI. http://emeld.org/workshop/2006/papers/lewis.pdf

Simons, G. F., Fitzsimons, B., Langendoen, D. T., Lewis, W. D., Farrar, S. O., Lanham, A., Basham, R. & Gonzalez, H. (2004), A model for interoperability: XML documents as an RDF database, in ‘Proceedings of the EMELD Workshop on Databases’, Detroit, MI. http://faculty.washington.edu/wlewis2/papers/Sim-etal04a.pdf

Simons, G. F., Lewis, W. D., Farrar, S. O., Langendoen, D. T., Fitzsimons, B. & Gonzalez, H. (2004), The semantics of markup: Mapping legacy markup schemas to a common semantics, in ‘Proceedings of the 4th workshop on NLP and XML (NLPXML2004): held in cooperation with ACL04’, Barcelona, Spain, pp. 25–32. http://faculty.washington.edu/wlewis2/papers/Sim-etal04b.pdf

Lewis, W. D. (2003), Mining and migrating interlinear glossed text, in ‘Proceedings of the EMELD Workshop on Digitizing and Annotating Texts and Field Recordings’, East Lansing, MI. http://emeld.org/workshop/2003/Lewis-paper.pdf

Farrar, S., Lewis, W. D., and Langendoen, D. T.  (2002a)  A common ontology for linguistic concepts, in 'Proceedings of the Knowledge Technologies Conference', Seattle, WA. http://faculty.washington.edu/wlewis2/papers/FarLewLang02a.pdf

Farrar, S., Lewis, W. D. & Langendoen, D. T. (2002b), An ontology for linguistic annotation, in ‘Semantic Web Meets Language Resources: Papers from the AAAI Workshop, Technical Report WS0216’, AAAI Press, Menlo Park, CA, pp. 11–19. http://faculty.washington.edu/wlewis2/papers/FarLewLang02b.pdf

Langendoen, D. T., Farrar, S. & Lewis, W. D. (2002), Bridging the markup gap: smart search engines for language researchers, in ‘Proceedings of the International Workshop on Resources and Tools in Field Linguistics’, Las Palmas, Gran Canaria, Spain. http://faculty.washington.edu/wlewis2/papers/LangFarLew02.pdf

Lewis, W. D., Farrar, S. & Langendoen, D. T. (2001), Building a knowledge base of morphosyntactic terminology, in ‘Proceedings of the IRCS Workshop on Linguistic Databases’, University of Pennsylvania, pp. 150–156. http://www.ldc.upenn.edu/annotation/database/papers/Langendoen etal/24.2.langendoen.pdf