Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Added PTPN11 analysis #278

Draft
wants to merge 2 commits into
base: develop
Choose a base branch
from
Draft

Added PTPN11 analysis #278

wants to merge 2 commits into from

Conversation

justaddcoffee
Copy link
Member

Currently failing because of a problem finding NP ID in Uniprot

According to Ensembl, the canonical transcript of PTPN11 is ENST00000351677.7 which corresponds to transcript NM_002834.5 and protein NP_002825.3

When I run this cell:

from gpsea.model.genome import GRCh38
from gpsea.preprocessing import VVMultiCoordinateService, \
    configure_default_protein_metadata_service

txc_service = VVMultiCoordinateService(genome_build=GRCh38)
tx_coordinates = txc_service.fetch(PTPN11_transcript_id) 

pms = configure_default_protein_metadata_service()
protein_meta = pms.annotate(PTPN11_protein_id)

I get this error:

ValueError: Could not find an entry for NP_002825.3 in Uniprot response

Uniprot lists these as other NP IDs:

NP_001317366.1 NM_001330437.1 [Q06124-1]
NP_002825.3 NM_002834.4 [Q06124-2]
NP_542168.1 NM_080601.2 [Q06124-3]

But same problem with all of these NPs

@justaddcoffee justaddcoffee marked this pull request as draft September 21, 2024 16:54
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant