Skip to search formSkip to main contentSkip to account menu
DOI:10.1101/2021.06.03.446934 - Corpus ID: 235348928
@article{Cantalapiedra2021eggNOGmapperVF, title={eggNOG-mapper v2: Functional Annotation, Orthology Assignments, and Domain Prediction at the Metagenomic Scale}, author={Carlos P. Cantalapiedra and Ana Hern{\'a}ndez-Plaza and Ivica Letunic and Peer Bork and Jaime Huerta-Cepas}, journal={Molecular Biology and Evolution}, year={2021}, volume={38}, pages={5825 - 5829}, url={https://api.semanticscholar.org/CorpusID:235348928}}
- C. Cantalapiedra, Ana Hernández-Plaza, J. Huerta-Cepas
- Published in bioRxiv 3 June 2021
- Computer Science, Biology, Environmental Science
A major upgrade to eggNOG-mapper, a tool for functional annotation based on precomputed orthology assignments, now optimized for vast (meta)genomic data sets, is described.
1,268 Citations
156
70
403
1
Topics
eggNOG-Mapper (opens in a new tab)Functional Annotation (opens in a new tab)Genomics (opens in a new tab)Genome (opens in a new tab)
1,268 Citations
- Fabian RupertiNikolaos PapadopoulosJ. MusserD. Arendt
- 2022
Computer Science, Biology
This work demonstrates that structural similarity is more powerful to generate functional prediction and annotation than sequence alone, bridging longer evolutionary distances.
- Highly Influenced
- PDF
- Fabian RupertiNikolaos PapadopoulosJ. MusserM. MirditaMartin SteineggerD. Arendt
- 2023
Computer Science, Biology
Genome Biology
It is demonstrated that structural similarity is a powerful approach that complements and extends sequence similarity searches to identify hom*ologous proteins over long evolutionary distances, and will be a powerful approach that boosts discovery in numerous -omics datasets, especially for non-model organisms.
- 12
- Highly Influenced
- PDF
- Ana Hernández-PlazaDamian Szklarczyk J. Huerta-Cepas
- 2023
Biology, Computer Science
Nucleic Acids Res.
The eggNOG 6.0 website contains new functionality to mine orthology and functional data with ease, including the possibility of generating phylogenetic profiles for multiple OGs across species or identifying single-copy OGs at custom taxonomic levels.
- 61 [PDF]
- C. FarkasA. Recabal T. Caprile
- 2022
Biology, Computer Science
GigaScience
An easy-to-use genome-guided transcriptome annotation pipeline that uses assembled transcripts from hybrid sequencing data as input and distinguishes between coding and long non-coding RNAs by integration of several bioinformatic approaches, including gene reconciliation with previous annotations in GTF format is developed.
- 1
- PDF
- Gemma I. Martínez-RedondoCarlos Vargas-ChávezKlara EleftheriadiLisandra Benítez-ÁlvarezMarçal Vázquez-VallsRosa Fernández
- 2024
Biology, Environmental Science
bioRxiv
The newest version of MATEdb (MATEdb2) is presented, a repository of high-quality transcriptomic and genomic data for the two most diverse animal phyla that overcomes some of the previous limitations of the database and provides gene annotations from genomes obtained using the same pipeline.
- 1
- PDF
- D. AlbaneseC. ColeineL. SelbmannC. Donati
- 2023
Environmental Science, Biology
bioRxiv
This work presents metashot/aweMAGs, a fully automated workflow capable of performing quality assessment, annotation and phylogenomic placement of eukaryotic MAGs, and can run out-of-the-box on any platform that supports Docker, Singularity and Nextflow, including computing clusters or batch systems in the cloud.
- PDF
- Nicholas P CooleyE. Wright
- 2021
Computer Science, Biology
NAR genomics and bioinformatics
A new approach for classifying proteins into a taxonomy of functions and IDTAXA is developed, which was more accurate than BLAST or HMMER at assigning sequences to KEGG ortholog groups and demonstrated its utility for genome annotation.
- Rosa FernándezVanina Tonzo Carlos Vargas-Chávez
- 2022
Biology, Computer Science
bioRxiv
This work presents Metazoan Assemblies from Transcriptomic Ensembles (MATEdb), a curated database of 335 high-quality transcriptome assemblies from different animal phyla analyzed following the same pipeline, totalling 423 high- quality genomic and transcriptomic data sets.
- 11
- PDF
- J. C. SequeiraMiguel RochaM. AlvesA. Salvador
- 2022
Computer Science, Biology
Computational and structural biotechnology…
- 10
- PDF
- Aysun UrhanBianca-Maria CosmaA. EarlA. MansonT. Abeel
- 2023
Computer Science, Biology
bioRxiv
To predict gene functions in bacteria, SAP is developed, a novel synteny-aware gene function prediction tool based on protein embeddings from state-of-the-art protein language models that leverages the unique operon structure of bacteria through conserved synteny.
- PDF
...
...
19 References
- J. Huerta-CepasK. Forslund P. Bork
- 2016
Computer Science, Biology
bioRxiv
EggNOG-mapper, a tool for functional annotation of large sets of sequences based on fast orthology assignments using precomputed clusters and phylogenies from eggNOG, benchmarked Gene Ontology predictions against two widely used hom*ology-based approaches: BLAST and InterProScan.
- 1,872
- PDF
- J. Huerta-CepasDamian Szklarczyk P. Bork
- 2019
Biology, Environmental Science
Nucleic Acids Res.
Abstract eggNOG is a public database of orthology relationships, gene evolutionary histories and functional annotations. Here, we present version 5.0, featuring a major update of the underlying…
- 2,324 [PDF]
- D. MendeIvica Letunic P. Bork
- 2020
Biology, Computer Science
Nucleic Acids Res.
The presented proGenomes2 provides threefold more genomes, enhanced habitat annotations, updated taxonomic and functional annotation and improved linkage to the NCBI BioSample database.
- 65 [PDF]
- C. J. NorsigianNeha Pusarla Zachary A. King
- 2020
Biology, Computer Science
Nucleic Acids Res.
New content and features in the BiGG Models repository are detail, continuing the original effort to connect each model to genome annotations and external databases as well as standardization of reactions and metabolites.
- 168 [PDF]
- Natasha M. GloverC. Dessimoz P. Thomas
- 2019
Biology, Computer Science
Molecular biology and evolution
The recent advances and outstanding challenges in orthology inference are reviewed, as revealed at a symposium and meeting held at the University of Southern California in 2017.
- 64 [PDF]
- Stefan Go¨tzJuan Miguel Garcı´a-Go´mez Ana Conesa
Computer Science, Biology
The Blast2GO framework is used to carry out a detailed analysis of annotation behaviour through hom*ology transfer and its impact in functional genomics research to offer biologists useful information to take into account when addressing the task of functionally characterizing their sequence data.
- 3,116
- PDF
- Jaina MistryR. FinnS. EddyA. BatemanM. Punta
- 2013
Biology, Computer Science
Nucleic acids research
It is found that HMMER3 E-value estimates seem to be less accurate for families that feature periodic patterns of compositional bias, such as the ones typically observed in coiled-coils, which supports the continued use of manually curated inclusion thresholds in the Pfam database.
- 1,107 [PDF]
- Philip JonesDavid Binns S. Hunter
- 2014
Computer Science, Biology
Bioinform.
A new Java-based architecture for the widely used protein function prediction software package InterProScan is described, resulting in a flexible and stable system that is able to use both multiprocessor machines and/or conventional clusters to achieve scalable distributed data analysis.
- 5,771 [PDF]
- T. G. O. Consortium
- 2019
Biology, Computer Science
Nucleic Acids Res.
GO-CAM, a new framework for representing gene function that is more expressive than standard GO annotations, has been released, and users can now explore the growing repository of these models.
- 3,188
- PDF
- R. TatusovMichael Y. GalperinD. NataleE. Koonin
- 2000
Biology, Computer Science
Nucleic Acids Res.
The database of Clusters of Orthologous Groups of proteins (COGs) is an attempt on a phylogenetic classification of the proteins encoded in 21 complete genomes of bacteria, archaea and eukaryotes.
- 3,764
- PDF
...
...
Related Papers
Showing 1 through 3 of 0 Related Papers