Svoboda | Graniru | BBC Russia | Golosameriki | Facebook

Skip Header

You are using a version of browser that may not display all the features of this website. Please consider upgrading your browser.

Retrieve / ID mapping

Tutorial/Video

Select the Retrieve/ID mapping tab of the toolbar and enter or upload a list of identifiers (or gene names) to do one of the following:

  • Retrieve the corresponding UniProt entries to download them or work with them on this website.
  • Convert identifiers which are of a different type to UniProt identifiers or vice versa, and download the identifier lists.

How to use this tool

  1. Enter identifiers or upload them from a file, separated by a space or a new line, into the form field, for example: P31946 P62258 ALBU_HUMAN
  2. If you need to convert to another identifier type (as performed previously by the “ID mapping” service), select the source and target type from the “From/To” dropdown menus under “Options”. Otherwise, to retrieve or download a list UniProtKB entries, keep the default selection of these menus (from UniProtKB AC/ID to UniProtKB)
  3. Click the Submit button.

The following kinds of UniProt identifiers are supported:

UniProtKBP00750UniProtKB entry
  P00750-2UniProtKB entry isoform sequence
  P00750[39-81]UniProtKB sequence range
  A4_HUMANUniProtKB entry name
UniParcUPI0000000001UniParc entry
UniRefUniRef100_P00750UniRef entry

When mapping from a source database external to UniProt, you can submit any identifier as used in the UniProtKB cross-references . If your job is not successful and you are not sure which source database to use, try a text search in UniProtKB with one of your identifiers, and look at an example entry. Check out the cross-reference section to find out which database uses these identifiers.

Further queries involving your UniProtKB data sets

After you have submitted your data, you are forwarded to a query result page showing the correspondence of submitted identifiers (from external databases, or obsolete UniProtKB identifiers) with current UniProtKB accession numbers. You can use the basket, download and align services like in any query result, as well as reconfigure the table layout (“Columns”) or add additional constraints to your query.

Jobs have unique identifiers, which (depending on the job type) can be used in queries (e.g. to get the intersection of two sequence similarity searches). Job identifiers and the related data are kept for 7 days, and are then deleted.

Unmapped identifiers

The list of identifiers that could not be mapped can be retrieved for further inspection or analysis.

When mapping popular sequence database identifiers such as RefSeq, gi numbers, EMBL, EMBLCDS to UniProtKB, unmapped identifiers can be further mapped to UniParc. This can be particularly useful for proteins from redundant proteomes.

Programmatic access

Code examples for programmatic access are available in the relevant API help pages:
Programmatic access – Mapping database identifiers
Programmatic access – Batch retrieval of entries

Notes

  • Very large mapping requests (>50,000 identifiers) are likely to fail. Please do verify that your list does not contain any duplicates, and try to split it into smaller chunks (<20,000) in case of problems. If you prefer to run your mapping locally, you can also download the data underlying this service.

See also: Related questions from our FAQ

Related terms: batch, bulk

UniProt is an ELIXIR core data resource
Main funding by: National Institutes of Health

We'd like to inform you that we have updated our Privacy Notice to comply with Europe’s new General Data Protection Regulation (GDPR) that applies since 25 May 2018.

Do not show this banner again