Skip to content
Snippets Groups Projects
Commit a89fcf46 authored by Vermaat's avatar Vermaat
Browse files

In LRG, only support one CDS per transcript (ignore all others)

The LRG model allows for more than one CDS per transcript (and each CDS
can be coupled with more than one transcript). The Mutalyzer gene model
does not allow that.

Previously, multiple CDSs per transcript where treated incorrectly (I
think), so now we just ignore any CDS except the first per transcript.

Unfortunately, it is a bit more involved to give good warnings for this,
so that is not implemented.
parent 5c1e01dc
No related branches found
No related tags found
No related merge requests found
......@@ -192,7 +192,16 @@ def create_record(data):
# ending position, keep the possibility in mind that multiple CDS
# regions are given
CDSPList = GenRecord.PList()
for CDS in tData.getElementsByTagName("coding_region"):
for cds_id, CDS in enumerate(tData.getElementsByTagName("coding_region")):
if cds_id > 0:
# Todo: For now, we only support one CDS per transcript and
# ignore all others.
# By the way, I don't think the loop and sorting of CDS
# positions makes any sense here, but I leave it in place
# and just ignore everything except the first iteration.
#translationName = CDS.getElementsByTagName("translation")[0].getAttribute("name").encode("utf8")[1:]
#print 'Ignoring transcript %s translation %s' % (transcriptName, translationName)
continue
coordinates = _get_coordinates(CDS, lrg_id)
CDSPList.positionList.extend(\
[int(coordinates["start"]), int(coordinates["end"])])
......
0% Loading or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment