|
|
|
|
LEADER |
01000naa a22002652 4500 |
001 |
NLM291359140 |
003 |
DE-627 |
005 |
20231225070716.0 |
007 |
cr uuu---uuuuu |
008 |
231225s2018 xx |||||o 00| ||eng c |
024 |
7 |
|
|a 10.1080/02664763.2018.1426741
|2 doi
|
028 |
5 |
2 |
|a pubmed24n0971.xml
|
035 |
|
|
|a (DE-627)NLM291359140
|
035 |
|
|
|a (NLM)30505061
|
040 |
|
|
|a DE-627
|b ger
|c DE-627
|e rakwb
|
041 |
|
|
|a eng
|
100 |
1 |
|
|a Du, Ruofei
|e verfasserin
|4 aut
|
245 |
1 |
0 |
|a Statistical correction for functional metagenomic profiling of a microbial community with short NGS reads
|
264 |
|
1 |
|c 2018
|
336 |
|
|
|a Text
|b txt
|2 rdacontent
|
337 |
|
|
|a ƒaComputermedien
|b c
|2 rdamedia
|
338 |
|
|
|a ƒa Online-Ressource
|b cr
|2 rdacarrier
|
500 |
|
|
|a Date Revised 20.11.2019
|
500 |
|
|
|a published: Print-Electronic
|
500 |
|
|
|a Citation Status PubMed-not-MEDLINE
|
520 |
|
|
|a By sequence homology search, the list of all the functions found and the counts of reads being aligned to them present the functional profile of a metagenomic sample. However, a significant obstacle has been observed in this approach due to the short read length associated with many next generation sequencing technologies. This includes artificial families, cross-annotations, length bias and conservation bias. The widely applied cutoff methods, such as BLAST E-value, are not able to solve the problems. Following the published successful procedures on the artificial families and the cross-annotation issue, we propose in this paper to use zero-truncated Poisson and Binomial (ZTP-Bin) hierarchical modelling to correct the length bias and the conservation bias. Goodness-of-fit of the modelling and cross-validation for the prediction using a bioinformatic simulated sample show the validity of this approach. Evaluated on an in vitro-simulated data set, the proposed modelling method outperforms other traditional methods. All three steps were then sequentially applied on real-life metagenomic samples to show that the proposed framework will lead to a more accurate functional profile of a short read metagenomic sample
|
650 |
|
4 |
|a Journal Article
|
650 |
|
4 |
|a Primary 62F10
|
650 |
|
4 |
|a conservation bias
|
650 |
|
4 |
|a functional profiling
|
650 |
|
4 |
|a length bias
|
650 |
|
4 |
|a metagenomics
|
650 |
|
4 |
|a secondary 62P10
|
650 |
|
4 |
|a short reads
|
700 |
1 |
|
|a Fang, Zhide
|e verfasserin
|4 aut
|
773 |
0 |
8 |
|i Enthalten in
|t Journal of applied statistics
|d 1991
|g 45(2018), 14 vom: 01., Seite 2521-2535
|w (DE-627)NLM098188178
|x 0266-4763
|7 nnns
|
773 |
1 |
8 |
|g volume:45
|g year:2018
|g number:14
|g day:01
|g pages:2521-2535
|
856 |
4 |
0 |
|u http://dx.doi.org/10.1080/02664763.2018.1426741
|3 Volltext
|
912 |
|
|
|a GBV_USEFLAG_A
|
912 |
|
|
|a SYSFLAG_A
|
912 |
|
|
|a GBV_NLM
|
912 |
|
|
|a GBV_ILN_350
|
951 |
|
|
|a AR
|
952 |
|
|
|d 45
|j 2018
|e 14
|b 01
|h 2521-2535
|