Third-party packages

This page lists third-party packages/modules built upon services.

MyGene python module

mygene” is an easy-to-use Python wrapper to access services.

You can install it easily using either pip or easy_install:

pip install mygene               #this is preferred


easy_install mygene

This is a brief example:

In [1]: import mygene

In [2]: mg = mygene.MyGeneInfo()

In [3]: mg.getgene(1017)
{'_id': '1017',
 'entrezgene': 1017,
 'name': 'cyclin-dependent kinase 2',
 'symbol': 'CDK2',
 'taxid': 9606}

In [4]:  mg.query('cdk2', size=2)
{'hits': [{'_id': '1017',
   '_score': 373.24667,
   'entrezgene': 1017,
   'name': 'cyclin-dependent kinase 2',
   'symbol': 'CDK2',
   'taxid': 9606},
  {'_id': '12566',
   '_score': 353.90176,
   'entrezgene': 12566,
   'name': 'cyclin-dependent kinase 2',
   'symbol': 'Cdk2',
   'taxid': 10090}],
 'max_score': 373.24667,
 'took': 10,
 'total': 28}

See for more details.

MyGene autocomplete widget

This autocomplete widget for gene query (built upon JQueryUI’s autocomplete widget) provides suggestions while you type a gene symbol or name into the field. You can easily embed it into your web application. It also provides many customization options for your different use-cases.

See for more details.

You can also play with this jsFiddle example:

Another MyGene Python wrapper

This is yet another Python wrapper of services created by Brian Schrader. It’s hosted at

It’s available from PyPI as well:

pip install mygene-api

Some basic examples:

  • Find a given gene with the id: CDK2.
""" Use the query API to find a gene with
the given symbol.
from mygene.gene import Gene

results = Gene.find_by(q='CDK2')
for r in result:
    print r._id,

>>> 1017 cyclin-dependent kinase 2
12566 cyclin-dependent kinase 2
362817 cyclin dependent kinase 2
52004 CDK2-associated protein 2
  • Given an known gene, get it’s begin and end coordinates.
""" Use the annotation API to find the full
details of a given gene.
from mygene.gene import gene

gene = Gene.get('1017')
print gene._id, gene.genomic_pos_hg19['start'], gene.genomic_pos_hg19['end']

>>> 1017 56360553 56366568
  • This library also supports the metadata API.
from mygene.metadata import Metadata

metadata = Metadata.get_metadata()
print metadata.stats['total_genes']

>>> 12611464