About: On InChI and evaluating the quality of cross-reference links

Facets (new session)
Description
Metadata
Settings
- owl:sameAs
- Inference Rule:

About: On InChI and evaluating the quality of cross-reference links Goto Sponge NotDistinct Permalink

An Entity of Type : http://linked.opendata.cz/ontology/domain/vavai/Vysledek, within Data Space : linked.opendata.cz associated with source document(s)

Attributes	Values
rdf:type	skos:Concept http://linked.opendata.cz/ontology/domain/vavai/Vysledek
rdfs:seeAlso	http://www.jcheminf.com/content/6/1/15
Description	There are many databases of small molecules focused on different aspects of research and its applications. Some tasks may require integration of information from various databases. However, determining which entries from different databases represent the same compound is not straightforward. Integration can be based, for example, on automatically generated cross-reference links between entries. Another approach is to use the manually curated links stored directly in databases. This study employs well-established InChI identifiers to measure the consistency and completeness of the manually curated links by comparing them with the automatically generated ones. We used two different tools to generate InChI identifiers and observed some ambiguities in their outputs. In part, these ambiguities were caused by indistinctness in interpretation of the structural data used. InChI identifiers were used successfully to find duplicate entries in databases. We found that the InChI inconsistencies in the manually curated links are very high (28.85% in the worst case). Even using a weaker definition of consistency, the measured values were very high in general. The completeness of the manually curated links was also very poor (only 93.8% in the best case) compared with that of the automatically generated links. We observed several problems with the InChI tools and the files used as their inputs. There are large gaps in the consistency and completeness of manually curated links if they are measured using InChI identifiers. However, inconsistency can be caused both by errors in manually curated links and the inherent limitations of the InChI method. There are many databases of small molecules focused on different aspects of research and its applications. Some tasks may require integration of information from various databases. However, determining which entries from different databases represent the same compound is not straightforward. Integration can be based, for example, on automatically generated cross-reference links between entries. Another approach is to use the manually curated links stored directly in databases. This study employs well-established InChI identifiers to measure the consistency and completeness of the manually curated links by comparing them with the automatically generated ones. We used two different tools to generate InChI identifiers and observed some ambiguities in their outputs. In part, these ambiguities were caused by indistinctness in interpretation of the structural data used. InChI identifiers were used successfully to find duplicate entries in databases. We found that the InChI inconsistencies in the manually curated links are very high (28.85% in the worst case). Even using a weaker definition of consistency, the measured values were very high in general. The completeness of the manually curated links was also very poor (only 93.8% in the best case) compared with that of the automatically generated links. We observed several problems with the InChI tools and the files used as their inputs. There are large gaps in the consistency and completeness of manually curated links if they are measured using InChI identifiers. However, inconsistency can be caused both by errors in manually curated links and the inherent limitations of the InChI method. (en)
Title	On InChI and evaluating the quality of cross-reference links On InChI and evaluating the quality of cross-reference links (en)
skos:prefLabel	On InChI and evaluating the quality of cross-reference links On InChI and evaluating the quality of cross-reference links (en)
skos:notation	RIV/61388963:_____/14:00429452!RIV15-AV0-61388963
http://linked.open...avai/riv/aktivita	I P
http://linked.open...avai/riv/aktivity	I, P(LH11020)
http://linked.open...iv/cisloPeriodika	Apr 17
http://linked.open...vai/riv/dodaniDat	2015
http://linked.open...aciTvurceVysledku	Vondrášek, Jiří Galgonek, Jakub
http://linked.open.../riv/druhVysledku	J - Článek v odborném periodiku
http://linked.open...iv/duvernostUdaju	S - Úplné a pravdivé údaje nepodléhající ochraně podle zvláštních právních předpisů
http://linked.open...titaPredkladatele	Ústav organické chemie a biochemie AV ČR, v. v. i.
http://linked.open...dnocenehoVysledku	34302
http://linked.open...ai/riv/idVysledku	RIV/61388963:_____/14:00429452
http://linked.open...riv/jazykVysledku	eng - angličtina
http://linked.open.../riv/klicovaSlova	databases; system; file (en)
http://linked.open.../riv/klicoveSlovo	file databases system
http://linked.open...odStatuVydavatele	GB - Spojené království Velké Británie a Severního Irska
http://linked.open...ontrolniKodProRIV	[78D17CDEDE50]
http://linked.open...i/riv/nazevZdroje	Journal of Cheminformatics
http://linked.open...in/vavai/riv/obor	CF
http://linked.open...ichTvurcuVysledku	2 (xsd:int)
http://linked.open...cetTvurcuVysledku	2 (xsd:int)
http://linked.open...vavai/riv/projekt	Systematic mapping of the conformational space of short peptides through molecular dynamics simulation - a way to understanding of protein structure formation.
http://linked.open...UplatneniVysledku	2014
http://linked.open...v/svazekPeriodika	6
http://linked.open...iv/tvurceVysledku	Vondrášek, Jiří Galgonek, Jakub
http://linked.open...ain/vavai/riv/wos	000335606300001
issn	1758-2946
number of pages	15 (xsd:int)
http://bibframe.org/vocab/doi	10.1186/1758-2946-6-15

Faceted Search & Find service v1.16.118 as of Jun 21 2024

Alternative Linked Data Documents: ODE Content Formats:

RDF

ODATA

Microdata

About

OpenLink Virtuoso version 07.20.3240 as of Jun 21 2024, on Linux (x86_64-pc-linux-gnu), Single-Server Edition (126 GB total memory, 34 GB memory in use)
Data on this page belongs to its respective rights holders.
Virtuoso Faceted Browser Copyright © 2009-2024 OpenLink Software