Site Caderige

CADERIGE

Bibliographie & Références

Publications des membres du projet

Publications dans des conférences

BESSIERES P., NAZARENKO A., NEDELLEC C. (2001).Apport de l'apprentissage à l'extraction d'information : le problème de l'identification d'interactions géniques. A paraitre dans les actes de CIDE 2001, 4e Colloque International sur le Document Electronique. Toulouse 24-26 oct. 2001, France. (PFD)

BISSON G., NEDELLEC C., CAÑAMERO L. (2000). Designing clustering methods for ontology building: The Mo'K workbench. Ontology Learning workshop (ECAI 2000), Berlin, 22 août 2000.

BISSON G. ET NEDELLEC C. (2001). Aide à la conception de méthodes de classification pour la construction d'ontologies : l'atelier Mo'K. In H. Brian Eds. Actes des Journées Francophones d'Extraction et de Gestion des Connaissances (EGC 2001), Hermès (Pub.) Nantes. (PDF)

NEDELLEC C., (2002). Ontology learning for Information Extraction in functional genomics : the Caderige project. Workshop Semantic Web ECML/PKDD 2002. (PDF)

NEDELLEC C., OULD ABDEL VETAH M., BESSIERES P., BRUN C., JACQ J. Reconnaître les fragments de phrases pertinents pour l'extraction d'information dans les textes de génomique, un problème de classification. Actes de la Conférence francophone d'Apprentissage (CAP 2001), PUG (eds). (PDF)

NEDELLEC C. ET NAZARENKO A. (2001). Application de l'apprentissage à la recherche et à l'extraction d'information - Un exemple, le projet Caderige : identification d'interactions géniques. In Actes de la Journée thématique Exploration de données issues d'Internet organisée le 2 mars 2001 au LIPN. Bennani Y., et al. (Eds). (PDF)

NEDELLEC C. ET OULD ABDEL VETAH M. (2001). Modélisation des interactions géniques à partir de textes. Journée Post-Génomique de la Doua (JPGD), Lyon.

NEDELLEC C. ET OULD ABDEL VETAH M., BESSIERES P. (2001). Sentence Filtering for Information Extraction in Genomics, a Classification Problem. To appear in proceeding of 5th European Conference on Principles and Practice of Knowledge Discovery in Databases (PKDD 01). September 3-7, 2001, Freiburg, Germany (PDF)

Présentations du projet

Présentation du projet CADERIGE aux journées organisées les 26 et 27 avril 2001 par XRCE et l'INRIA Rhône-Alpes sur le thème : "Les ontologies en biologie moléculaire et l'extraction d'informations à partir de textes". Ces journées s'inscrivaient dans le cadre de l'action IMPG ("Informatique, Mathématique et Physique pour la Génomique"). (PDF des transparents).

Présentation du projet aux journées BioInformatique organisées les 14-15 octobre à Paris au ministère de la Recherche. (PDF des transparents)

Synthèses & Documents de travail

NEDELLEC C. Synthèse bibliographique sur l'application de l'apprentissage à la recherche d'information. (PDF)

Publication de membres associés au projet

POIBEAU T. (2001). Extraction d'information dans les bases de données textuelles en génomique au moyen de transducteurs à nombre fini d'états. Conférence Terminologie et Intelligence Artificielle (TIA 2001).

Références bibliographiques diverses

ANDRADE M.A., VALENCIA A. (1998). Automatic extraction of keywords from scientific text: application to the knowldge domain of proteins families. BioInformatics volume 14 n°7. Page 600-607. (PDF)

BLASCHKE C., ANDRADE M. A., OUZOUNIS C. AND VALENCIA A. (2001). Automatic Extraction of biological information from scientific text: protein-protein interactions. In Proceedings of International Symposium on Molecular Biology, (ISMB'99).

BRILL E. (1992). A simple rule-based part of speech tagger. In Proceedings of the Third Conference on Applied Natural Language Processing, ACL. 31 March - 3 April 1992. Trento, Italy.

COLLIER N, NOBATA C. AND TSUJII (2000). Extracting the names of genes and gene products with a hidden Markov model. In Proceedings of the 18th International Conference on Computational Linguistics (COLING'2000), Saarbrück, Allemagne.

CRAVEN M. AND KUMLIEN J.(1999). Constructing Biological Knowledge Bases by Extracting Information from Text Sources., In Proceedings of the 7th International Conference on Intelligent Systems for Molecular Biology (ISMB-99).

DAILLE B. (1994). Approche mixte pour l'extraction de terminologie : statistique lexicale et filtres linguistiques. Thèse d'Informatique. Université de Paris VII. Février 1994.

FAURE D., NEDELLEC C. (1998). ASIUM: Learning subcategorization frames and restrictions of selection. In Yves Kodratoff, editor, 10th European Conference on Machine Learning (ECML 98) -- Workshop on Text Mining, Avril 1998. (PS)

FUKUDA K., TSUNODA T., TAMURA A. AND TAKAGI T. (1998). Toward Information Extraction: Identifying protein names from biological papers. In Proceedings of the Pacific Symposium on biocomputing (PSB'1998). (PDF)

HALDENWANG W. G. (1995). The sigma factors of Bacillus subtilis. Microbiol. Rev. vol 59, p. 1-30.

HUMPHREYS K., DEMETRIOU G, AND GAIZAUSKAS R. (2000). Two applications of information extraction to biological science article: enzyme interaction and protein structure. In Proceedings of the Pacific Symposium on biocomputing (PSB'2000), vol.5, p. 502-513, Honolulu.

MARCOTTE E.M., XENARIOS L., EISENBERG D. (2001). Mining literature for protein-protein interactions. In BioInformatics, Volume 17 n° 4. Page 359-363. (PDF)

MITCHELL, T.M. (1997). Machine Learning, Mac Graw Hill, 1997.

ONO T., HISHIGAKI H., TANIGAMI A., AND TAKAGI T. (2001). Automated extraction of information on protein-protein interactions from the biological literature. In Bioinformatics, vol 17 no 2 2001, pp. 155-161. (PDF)

PILLET V. (2000). Méthodologie d'extraction automatique d'information à partir de la littérature scientifique en vue d'alimenter un nouveau système d'information. Thèse de l'Université de droit, d'économie et des sciences d'Aix-Marseille.

PROUX, D., RECHENMANN, F., JULLIARD, L., PILLET, V., JACQ, B. (1998). Detecting Gene Symbols and Names in Biological Texts: A First Step toward Pertinent Information Exctraction. In Genome Informatics, S. Miyano and T. Takagi, (Eds), Universal Academy Press, Inc, Tokyo, Japan, p. 72 - 80.

QUINLAN J.R. (1992). C4.5: Programs for Machine Learning, Morgan Kaufmann.

STAPLEY B. J., BENOIT G. (2000). Bibliometrics: Information Retrieval and Visualization from co-occurrence of gene names in MedLine abstracts. In Proceedings of the Pacific Symposium on biocomputing (PSB'2000), 2000.

TAMANES J., OUZOUNIS C., CASARI G., SANDER C., VALENCIA A.(1998). EUCLID: automatic classification of protéeins in functional classes by their database annotations. In BioInformatics Applications Note. Volume 14 n° 6. Page 542-543. (PDF)

THOMAS, J., MILWARD, D., OUZOUNIS C., PULMAN S. AND CAROLL M. (2000). Automatic Extraction of Protein Interactions from Scientific Abstracts. In Proceedings of the Pacific Symposium on biocomputing (PSB'2000), vol.5, p. 502-513, Honolulu.

WOSTEN M. M. (1998). Eubacterial sigma-factors. FEMS Microbiol. Rev. vol 3, 127-50.

YANG Y.AND PEDERSEN J. (1997). a comparative study on feature selection in text categorization. In International Conference on ML.

YOSHIDA M., FUDUKA K., TAKAGI T. (2000). PNAD-CSS: a workbench for constructing a protein name abbreviation dictionary. In BioInformatics Ontology. Volume 16 n° 2. Page 169-175. (PDF)

Dernière modification : 10/03/05, gb