Testing new DocBook/HTML convertersThe Insectmobile (Insektmobilen) is a research project at the National History Museum of Denmark, University of Copenhagen, intending to investigate the diversity of flying insects in Denmark. In the summer of 2018, 2019 and 2020, almost 400 volunteers collected flying insects using large custom-made insect nets mounted on the roof of their cars. The bulk insect samples were processed with a non-destructive DNA extraction DNA metabarcoding protocol (dx.doi.org/10.17504/protocols.io.bmunk6ve), and sequences were assigned taxonomy by importing the fasta file into GBIF's sequence ID tool (https://www.gbif.org/tools/sequence-id). The sequences were queried against a 99% clustered version of the BOLD Public Database v2024-01-06 public data (COI-5P sequences).The dataset contains unidentified sequences and potential errors, and contaminants. For example, even though the primers used were developed as universal primers targeting freshwater insects, you will find other phyla, classes, etc. We share these sequences and associated data for the data to be as open as possible, butplease reannotate sequences and filter the data for your specific needs before using the data for analysis.Be aware that the samples may contain identification of organisms coming from the gut content of sampled insects and other environmental DNA sources. Sequence identification certainty is captured in the taxonAssignmentMethod field of the Identification table, and can be interpreted as follows:Thebit scoreis the required size of a sequence database in which the current match could be found just by chance. The bit score is a log2-scaled and normalised raw score. Each increase by one doubles the required database size (2-bit score).Theexpect valueis a parameter that describes the number of hits one can "expect" to see by chance when searching a database of a particular size. It decreases exponentially as the match score increases. Hence, a low expect value is better.identity: How much of the query(input) sequence aligns with the match in the reference database, in percent.matchType: Badges representing different identity thresholds (match types):Blast exact match: identity >= 99% and queryCoverage >= 80%. This is within the threshold of the OTU.Blast ambiguous match: identity >= 99% and queryCoverage >= 80%, but there is at least one more match with similar identity.Blast close match: identity < 99% but > 90% and queryCoverage >= 80%. It is something close to the OTU, maybe the same genus.Blast weak match: there is a match, but with identity < 90% or/and queryCoverage < 80%. Depending on the quality of the sequence, bit score, identity, and expect value, a higher taxon could be inferred from this.Blast no match: no match to the reference database
此資源出現紀錄的資料已發佈為達爾文核心集檔案(DwC-A),其以一或多組資料表構成分享生物多樣性資料的標準格式。
核心資料表包含 1 筆紀錄。
此 IPT 存放資料以提供資料儲存庫服務。資料與資源的詮釋資料可由「下載」單元下載。「版本」表格列出此資源的其它公開版本,以便利追蹤其隨時間的變更。
研究者應尊重以下權利聲明。:
此資料的發布者及權利單位為 Test Organization #1。
To the extent possible under law, the publisher has waived all rights to these data and has dedicated them to the Public Domain (CC0 1.0). Users may copy, modify, distribute and use the work, including for commercial purposes, without restriction.