Find-Health-Articles.com - making medical research available to everyone
Research article summary (published 2 Dec 2002):

Restoring accents in unknown biomedical words: application to the French MeSH thesaurus.

Full Abstract

In languages with diacritic marks, such as French, there remain instances of textual or terminological resources that are available in electronic form without diacritic marks, which hinders their use in natural language interfaces. In a specialized domain such as medicine, it is often the case that some words are not found in the available electronic lexicons. The issue of accenting unknown words then arises:
it is the theme of this work. We propose two internal methods for accenting unknown words, which both learn on a reference set of accented words the contexts of occurrence of the various accented forms of a given letter. One method is adapted from part-of-speech tagging, the other is based on finite state transducers. We show experimental results for letter e on the French version of the Medical Subject Headings thesaurus. With the best training set, the tagging method obtains a precision-recall breakeven point of 84.2+/-4.4% and the transducer method 83.8+/-4.5% (with a baseline at 64%) for the unknown words that contain this letter. A consensus combination of both increases precision to 92.0+/-3.7% with a recall of 75%. We perform an error analysis and discuss further steps that might help improve over the current performance.

 

Learn Faster Today      Improve your study skills

Author information

Author/s: Zweigenbaum, Pierre (P); Grabar, Natalia (N);

Affiliation: Mission de Recherche en Sciences et Technologies de l'Information Médicale, STIM/DSI, Assistance Publique, Hôpitaux de Paris, STIM CHU Pitié-Salpêtrière, 91 boulevard de l'Hôpital, 75634 Paris Cedex 13 France. pz@biomath.jussieu.fr

Journal and publication information

Publication Type: Journal Article

Journal: International journal of medical informatics (Int J Med Inform), published in Ireland. (Language: eng)

Reference: 2002-Dec; vol 67 (issue 1-3) : pp 113-26

Dates: Created 2002/12/03; Completed 2003/04/28; Revised 2004/11/17;

PMID: 12460636, status: MEDLINE (last retrieval date: 11/6/2008)

Sourced from the National Library of Medicine. Abstract text and other information may be subject to copyright.

External Links for this article (including full text providers, if available):

Click Electronic Full-text Provider Links to see options for finding the electronic full text links to this article. Note there may be a subscription or fee required for access to the full text. See our FAQ for information on finding FREE full text articles.

This article may also be located in paper journal collections available in many libraries. Use the Journal and Publication Information above to find the full article.

MeSH headings (categories)

This article was linked to the MESH Headings shown below.

Related articles

This article has not been indexed for related articles as yet, however you can still use the live related article search links below.

See 100+ related articles.

See a large map of 100+ related articles.

© Advanogy.com 2003-2008 - All rights reserved. Terms of Use | Contact Us | Index