MCPD in ICIS
From ICISWiki
Mapping the Multi-Crop Passport Descriptors (MCPD) onto ICIS
The multi-crop passport descriptors are an agreed international standard, in flat file format (one row per accession, one column per field) for sharing passport data that are common to all crops.
These passport data are stored in several different tables in the GMS of ICIS. This page maps the MCPD onto to the corresponding fields in ICIS.
The accession, the donor's sample, and the collected or bred sample
A major structural difference between MCPD and ICIS is the handling of donor's sample, the collected sample and the bred sample.
- In MCPD, one record corresponds to one accession, and information about the donor's sample, collected sample and bred sample is recorded as different fields of that record.
- In ICIS, up to three GIDs are used for each accession:
- GID for the accession itself;
- GID for the sample held by the donor, pointed to by GPID2 of the accession GID; and
- GID for the original cross or collected sample, pointed to by GPID1 of the accession GID.
To retrieve them in IRIS, 3 instances of the GERMPLSM, NAMES, ATRIBUTS and LOCATION tables are needed, joined by GPID1 and GPID2. In this page, the three instances are referred to as:
- g, n, a, L = instance of germplasm, names, atributs or location tables used for the accession itself, joined by g.GID=n.GID etc
- g1, n1, a1, L1 = instance of germplasm, names, atributs or location tables used for the original cross or collected sample, joined by g.GPID1=g1.GID and g1.GID=n1.GID etc
- g2, n2, a2, L2 = instance of germplasm, names, atributs or location tables used for the donor's sample, joined by g.GPID2=g2.GID and g2.GID=n2.GID etc
The Map
| MCPD field | Brief description | Equivalent in IRIS |
|---|---|---|
| 01. InstCode | FAO Code of the institute maintaining the sample | institut.faocode where institut.locid=g.glocn |
| 02. AcceNumb | "Accession number" as the sample's unique ID. | n.nval where n.nstat=8 (i.e. the name is a preferred ID) For data strictly on genebank accessions, also restrict to names.ntype=1 (i.e. the name is the ID of a genebank accession); however this restriction may be relaxed if other types of material are to be included, e.g. breeding lines or entries in international nurseries. Also if we are including non-genebank accessions there may be GIDs that do not have a name with nstat=8. In this case, we use the "ICIS unique ID", which in the case of rice = "IRIS" & g.germuid & g.lgid |
| 03. CollNumb | Collecting number = sample ID assigned by collector when collected | (Only for a GID derived by derivative or maintenance methods from an original collected sample, i.e. where g1.methn=69) n1.nval where n1.nstat=8 and n1.ntype=9 and g1.methn=69 |
| 04. CollCode | Collecting institute code | (Only for a GID derived by derivative or maintenance methods from an original collected sample, i.e. where g1.methn=69) institute.faocode where g1.gid=listdata.gid and listdata.listid=listnms.listid and teammember.listid=listdata.listid and listnms.listtype=collmiss and teammember.personid=persons.personid and institute.institid=person.personid |
| 05. Genus 06. Species 07. SpAuthor 08. Subtaxa 09. SubtAuth | ICIS doesn't (yet) handle taxonomy as separate fields. Instead these fields are merged into a single string as an attribute | If (a.gid=g.GID and a.AID=1141) is not null (i.e. if the accession has an attribute 1141), scaledis.valdesc where scaledis.scaleid=532 and a.aval=scaledis.value and a.gid=g.GID and a.AID=1141 If the GID has no attribute with AID=1141, it may be possible to put a default for the database. For IRIS, we assume "Oryza sativa (L.)" for missing attribute 1141. |
| 10. CropName | Common crop name | If (a.gid=g.GID and a.AID=1133) is not null (i.e. if the accession has an attribute 1133), scaledis.valdesc where scaledis.scaleid=526 and a.aval=scaledis.value and a.gid=g.GID and a.AID=1133 If the GID has no attribute with AID=1133, it may be possible to put a default for the database. For IRIS, this is "rice" Append n.nval where n.ntype=23 |
| 11. AcceName | Accession name | Start with the preferred name (n.nval where n.nstat=1). If there are non-preferred names that are not included elsewhere in the MCPD (i.e. n.nstat=0 and n.ntype= one of [4,5,6,7,10,11,12,13,15,16,17,20,24] ), concatenate them, separating successive names with ";". |
| 12. AcqDate | Acquisition date | g.GDATE |
| 13. OrigCty | Country of origin | L1C.lname where L1C.locid=L1.cntryID and L1.locid=g1.glocn (where L1C and L1 are two separate instances of the LOCATION table) |
| 14. CollSite | Location of collecting site | (Only for a GID derived by derivative or maintenance methods from an original collected sample, i.e. where g1.methn=69): Concatenation of L1.lname, L1T.lname, L1D.lname, L1P.lname where L1.locid=g1.glocn and L1.snl3id=L1T.locid and L1.snl2id=L1D.locid and L1.snl1id=L1P.locid (where L1P, L1D, L1T and L1 are four separate instances of the location table, documenting respectively the province, district, town and location) |
| 15. Latitude | Latitude of collecting site | (Only for a GID derived by derivative or maintenance methods from an original collected sample, i.e. where g1.methn=69) georef.lat where georef.locid=g1.locn and g1.methn=69 |
| 16. Longitud | Longitude of collecting site | (Only for a GID derived by derivative or maintenance methods from an original collected sample, i.e. where g1.methn=69) georef.lon where georef.locid=g1.locn and g1.methn=69 |
| 17. Elevatio | Altitude of collecting site | (Only for a GID derived by derivative or maintenance methods from an original collected sample, i.e. where g1.methn=69) georef.alt where georef.locid=g1.locn and g1.methn=69 |
| 18. CollDate | Collecting date of sample | (Only for a GID derived by derivative or maintenance methods from an original collected sample, i.e. where g1.methn=69) g1.gdate where g1.methn=69 |
| 19. BredCode | Breeding institute code |
(Only for a GID derived by derivative or maintenance methods from a cross, i.e. where g1.methn has a GEN type) |
| 20. SampStat | Biological status of accession | If (a.gid=g.GID and a.AID=1144) is not null (i.e. if the accession has an attribute 1144), scaledis.valdesc where scaledis.scaleid=532 and a.aval=scaledis.value Otherwise, if (n.ntype=4) is not null or (n.ntype=20) is not null, put "Improved cultivar" Otherwise, if methods.mtype="DER" where methods.mid=g.methn, put "Breeding line" Otherwise null |
| 21. Ancest | Pedigree | (Only for a GID derived by derivative or maintenance methods from a cross, i.e. where g1.methn has a GEN type) Constructed dynamically from the pedigree in ICIS |
| 22. CollSrc | Collecting/acquisition source | (Only for a GID derived by derivative or maintenance methods from an original collected sample, i.e. where g1.methn=69) scaledis.valdesc where scaledis.scaleid=527 and a.aval=scaledis.value and a.gid=g.GID and a.AID=1135 |
| 23. DonorCode | Donor institute code | institut.faocode where institut.locid=g2.glocn |
| 24. DonorNum | Donor accession number | n2.nval where n2.nstat=8 |
| 26. DuplSite | Location of safety duplicates | (This is information in IRIS-GRIMS is present only in the local database, not publicised in central) |
| 27. Storage | Type of germplasm storage | (This is information in IRIS-GRIMS is present only in the local database, not publicised in central) |
| 28. Remarks | Remarks | "NOTES" attribute: a.aval where a.AID=103 and a.gid=g.gid |

