Unidad de Minería de Textos en Biología

Inicio | Investigación e innovación | Programas Científicos | Programa de Biología Estructural | Unidad de Minería de Textos en Biología

Vacante Director

Biomedical cancer research is a particularly data-heavy discipline, where key information sources are not only limited to genomic information or raw experimental data. Especially unstructured data, such as the scientific literature, clinical texts, medicinal chemistry patents or patient generated content, constitute a valuable resource for a range of scenarios like drug discovery, interpretation of large scale experimental results, drug repurposing or evidence based medicine. Medical big data approaches are only able to efficiently exploit running texts through the use of natural language processing (NLP) techniques relying on deep learning and artificial intelligence strategies. Our Unit is financed through the Plan for the Advancement of Language Technologies; the aim is to generate resources that can improve the exploitation of biomedical data by means of implementing and evaluating the underlying quality of systems for automatic recognition of medical concepts, generation of specialised neural machine translation models for the medical domain and the implementation of a medical language technology platform and software components for processing Spanish EHRs.