Junhyong Kim
Stephen A Fisher
Ontology for RNA sequencing (ORNASEQ)
The following ontology is provided by Stephen A Fisher and Junhyong Kim to annotate next-generation sequencing experiments performed on RNA.
English language definitions of what NCI means by the concept. These are limited to 1024 characters. They may also include information about the definition's source and attribution in a form that can easily be interpreted by software.
The official OBI definition, explaining the meaning of a class or property. Shall be Aristotelian, formalized and normalized. Can be augmented with colloquial definitions.
The official definition, explaining the meaning of a class or property. Shall be Aristotelian, formalized and normalized. Can be augmented with colloquial definitions.
2012-04-05:
Barry Smith
The official OBI definition, explaining the meaning of a class or property: 'Shall be Aristotelian, formalized and normalized. Can be augmented with colloquial definitions' is terrible.
Can you fix to something like:
A statement of necessary and sufficient conditions explaining the meaning of an expression referring to a class or property.
Alan Ruttenberg
Your proposed definition is a reasonable candidate, except that it is very common that necessary and sufficient conditions are not given. Mostly they are necessary, occasionally they are necessary and sufficient or just sufficient. Often they use terms that are not themselves defined and so they effectively can't be evaluated by those criteria.
On the specifics of the proposed definition:
We don't have definitions of 'meaning' or 'expression' or 'property'. For 'reference' in the intended sense I think we use the term 'denotation'. For 'expression', I think we you mean symbol, or identifier. For 'meaning' it differs for class and property. For class we want documentation that let's the intended reader determine whether an entity is instance of the class, or not. For property we want documentation that let's the intended reader determine, given a pair of potential relata, whether the assertion that the relation holds is true. The 'intended reader' part suggests that we also specify who, we expect, would be able to understand the definition, and also generalizes over human and computer reader to include textual and logical definition.
Personally, I am more comfortable weakening definition to documentation, with instructions as to what is desirable.
We also have the outstanding issue of how to aim different definitions to different audiences. A clinical audience reading chebi wants a different sort of definition documentation/definition from a chemistry trained audience, and similarly there is a need for a definition that is adequate for an ontologist to work with.
PERSON:Daniel Schober
GROUP:OBI:<http://purl.obolibrary.org/obo/obi>
definition
definition
textual definition
An administrative note intended for its editor. It may not be included in the publication version of the ontology, so it should contain nothing necessary for end users to understand the ontology.
PERSON:Daniel Schober
GROUP:OBI:<http://purl.obfoundry.org/obo/obi>
editor note
editor note
Name of editor entering the term in the file. The term editor is a point of contact for information regarding the term. The term editor may be, but is not always, the author of the definition, which may have been worked upon by several people
20110707, MC: label update to term editor and definition modified accordingly. See http://code.google.com/p/information-artifact-ontology/issues/detail?id=115.
20110707, MC: label update to term editor and definition modified accordingly. See https://github.com/information-artifact-ontology/IAO/issues/115.
PERSON:Daniel Schober
GROUP:OBI:<http://purl.obolibrary.org/obo/obi>
definition editor
definition editor
term editor
term editor
An alternative name for a class or property which means the same thing as the preferred name (semantically equivalent)
PERSON:Daniel Schober
GROUP:OBI:<http://purl.obolibrary.org/obo/obi>
alternative term
formal citation, e.g. identifier in external database to indicate / attribute source(s) for the definition. Free text indicate / attribute source(s) for the definition. EXAMPLE: Author Name, URI, MeSH Term C04, PUBMED ID, Wiki uri on 31.01.2007
PERSON:Daniel Schober
Discussion on obo-discuss mailing-list, see http://bit.ly/hgm99w
GROUP:OBI:<http://purl.obolibrary.org/obo/obi>
definition source
definition source
imported from
label
a core relation that holds between a part and its whole
part of
A relation between a planned process and a continuant participating in that process. The presence of the continuant at the end of the process is explicitly specified in the objective specification which the process realizes the concretization of.
is_specified_output_of
a relation between an independent continuant (the bearer) and a specifically dependent continuant (the dependent), in which the dependent specifically depends on the bearer for its existence
bearer of
a relation between a continuant and a process, in which the continuant is somehow involved in the process
participates in
a relation between an independent continuant (the bearer) and a role, in which the role specifically depends on the bearer for its existence
has role
x overlaps y if and only if there exists some z such that x has part z and z part of y
overlaps
inverse of starts with
starts
A mereological relationship or a topological relationship
mereotopologically related to
BFO 2 Reference: In all areas of empirical inquiry we encounter general terms of two sorts. First are general terms which refer to universals or types:animaltuberculosissurgical procedurediseaseSecond, are general terms used to refer to groups of entities which instantiate a given universal but do not correspond to the extension of any subuniversal of that universal because there is nothing intrinsic to the entities in question by virtue of which they – and only they – are counted as belonging to the given group. Examples are: animal purchased by the Emperortuberculosis diagnosed on a Wednesdaysurgical procedure performed on a patient from Stockholmperson identified as candidate for clinical trial #2056-555person who is signatory of Form 656-PPVpainting by Leonardo da VinciSuch terms, which represent what are called ‘specializations’ in [81
Entity doesn't have a closure axiom because the subclasses don't necessarily exhaust all possibilites. For example Werner Ceusters 'portions of reality' include 4 sorts, entities (as BFO construes them), universals, configurations, and relations. It is an open question as to whether entities as construed in BFO will at some point also include these other portions of reality. See, for example, 'How to track absolutely everything' at http://www.referent-tracking.com/_RTU/papers/CeustersICbookRevised.pdf
entity
BFO 2 Reference: Continuant entities are entities which can be sliced to yield parts only along the spatial dimension, yielding for example the parts of your table which we call its legs, its top, its nails. ‘My desk stretches from the window to the door. It has spatial parts, and can be sliced (in space) in two. With respect to time, however, a thing is a continuant.’ [60, p. 240
Continuant doesn't have a closure axiom because the subclasses don't necessarily exhaust all possibilites. For example, in an expansion involving bringing in some of Ceuster's other portions of reality, questions are raised as to whether universals are continuants
continuant
BFO 2 Reference: every occurrent that is not a temporal or spatiotemporal region is s-dependent on some independent continuant that is not a spatial region
BFO 2 Reference: s-dependence obtains between every process and its participants in the sense that, as a matter of necessity, this process could not have existed unless these or those participants existed also. A process may have a succession of participants at different phases of its unfolding. Thus there may be different players on the field at different times during the course of a football game; but the process which is the entire game s-depends_on all of these players nonetheless. Some temporal parts of this process will s-depend_on on only some of the players.
Occurrent doesn't have a closure axiom because the subclasses don't necessarily exhaust all possibilites. An example would be the sum of a process and the process boundary of another process.
Simons uses different terminology for relations of occurrents to regions: Denote the spatio-temporal location of a given occurrent e by 'spn[e]' and call this region its span. We may say an occurrent is at its span, in any larger region, and covers any smaller region. Now suppose we have fixed a frame of reference so that we can speak not merely of spatio-temporal but also of spatial regions (places) and temporal regions (times). The spread of an occurrent, (relative to a frame of reference) is the space it exactly occupies, and its spell is likewise the time it exactly occupies. We write 'spr[e]' and `spl[e]' respectively for the spread and spell of e, omitting mention of the frame.
occurrent
b is an independent continuant = Def. b is a continuant which is such that there is no c and no t such that b s-depends_on c at t. (axiom label in BFO2 Reference: [017-002])
independent continuant
p is a process = Def. p is an occurrent that has temporal proper parts and for some time t, p s-depends_on some material entity at t. (axiom label in BFO2 Reference: [083-003])
BFO 2 Reference: The realm of occurrents is less pervasively marked by the presence of natural units than is the case in the realm of independent continuants. Thus there is here no counterpart of ‘object’. In BFO 1.0 ‘process’ served as such a counterpart. In BFO 2.0 ‘process’ is, rather, the occurrent counterpart of ‘material entity’. Those natural – as contrasted with engineered, which here means: deliberately executed – units which do exist in the realm of occurrents are typically either parasitic on the existence of natural units on the continuant side, or they are fiat in nature. Thus we can count lives; we can count football games; we can count chemical reactions performed in experiments or in chemical manufacturing. We cannot count the processes taking place, for instance, in an episode of insect mating behavior.Even where natural units are identifiable, for example cycles in a cyclical process such as the beating of a heart or an organism’s sleep/wake cycle, the processes in question form a sequence with no discontinuities (temporal gaps) of the sort that we find for instance where billiard balls or zebrafish or planets are separated by clear spatial gaps. Lives of organisms are process units, but they too unfold in a continuous series from other, prior processes such as fertilization, and they unfold in turn in continuous series of post-life processes such as post-mortem decay. Clear examples of boundaries of processes are almost always of the fiat sort (midnight, a time of death as declared in an operating theater or on a death certificate, the initiation of a state of war)
process
realizable entity
b is a specifically dependent continuant = Def. b is a continuant & there is some independent continuant c which is not a spatial region and which is such that b s-depends_on c at every time t during the course of b’s existence. (axiom label in BFO2 Reference: [050-003])
Specifically dependent continuant doesn't have a closure axiom because the subclasses don't necessarily exhaust all possibilites. We're not sure what else will develop here, but for example there are questions such as what are promises, obligation, etc.
specifically dependent continuant
BFO 2 Reference: One major family of examples of non-rigid universals involves roles, and ontologies developed for corresponding administrative purposes may consist entirely of representatives of entities of this sort. Thus ‘professor’, defined as follows,b instance_of professor at t =Def. there is some c, c instance_of professor role & c inheres_in b at t.denotes a non-rigid universal and so also do ‘nurse’, ‘student’, ‘colonel’, ‘taxpayer’, and so forth. (These terms are all, in the jargon of philosophy, phase sortals.) By using role terms in definitions, we can create a BFO conformant treatment of such entities drawing on the fact that, while an instance of professor may be simultaneously an instance of trade union member, no instance of the type professor role is also (at any time) an instance of the type trade union member role (any more than any instance of the type color is at any time an instance of the type length).If an ontology of employment positions should be defined in terms of roles following the above pattern, this enables the ontology to do justice to the fact that individuals instantiate the corresponding universals – professor, sergeant, nurse – only during certain phases in their lives.
role
b is a generically dependent continuant = Def. b is a continuant that g-depends_on one or more other entities. (axiom label in BFO2 Reference: [074-001])
generically dependent continuant
BFO 2 Reference: Material entities (continuants) can preserve their identity even while gaining and losing material parts. Continuants are contrasted with occurrents, which unfold themselves in successive temporal parts or phases [60
BFO 2 Reference: Object, Fiat Object Part and Object Aggregate are not intended to be exhaustive of Material Entity. Users are invited to propose new subcategories of Material Entity.
BFO 2 Reference: ‘Matter’ is intended to encompass both mass and energy (we will address the ontological treatment of portions of energy in a later version of BFO). A portion of matter is anything that includes elementary particles among its proper or improper parts: quarks and leptons, including electrons, as the smallest particles thus far discovered; baryons (including protons and neutrons) at a higher level of granularity; atoms and molecules at still higher levels, forming the cells, organs, organisms and other material entities studied by biologists, the portions of rock studied by geologists, the fossils studied by paleontologists, and so on.Material entities are three-dimensional entities (entities extended in three spatial dimensions), as contrasted with the processes in which they participate, which are four-dimensional entities (entities extended also along the dimension of time).According to the FMA, material entities may have immaterial entities as parts – including the entities identified below as sites; for example the interior (or ‘lumen’) of your small intestine is a part of your body. BFO 2.0 embodies a decision to follow the FMA here.
material entity
High molecular weight, linear polymers, composed of nucleotides containing deoxyribose and linked by phosphodiester bonds; DNA contain the genetic information of organisms.
deoxyribonucleic acid
Any constitutionally or isotopically distinct atom, molecule, ion, ion pair, radical, radical ion, complex, conformer etc., identifiable as a separately distinguishable entity.
molecular entity
A material entity of anatomical origin (part of or deriving from an organism) that has as its parts a maximally connected cell compartment surrounded by a plasma membrane.
cell
A class of large neuroglial (macroglial) cells in the central nervous system - the largest and most numerous neuroglial cells in the brain and spinal cord. Astrocytes (from 'star' cells) are irregularly shaped with many long processes, including those with 'end feet' which form the glial (limiting) membrane and directly and indirectly contribute to the blood-brain barrier. They regulate the extracellular ionic and chemical environment, and 'reactive astrocytes' (along with microglia) respond to injury.
astrocyte
A cell from the thermogenic form of adipose tissue found in many species, particularly in newborns and hibernating mammals, but also in lesser amounts in adults of other mammals including humans. Brown fat is capable of rapid liberation of energy and seems to be important in the maintenance of body temperature immediately after birth and upon waking from hibernation.
brown fat cell
The basic cellular unit of nervous tissue. Each neuron consists of a body, an axon, and dendrites. Their purpose is to receive, conduct, and transmit impulses in the nervous system.
neuron
Cardiac muscle cells are striated muscle cells that are responsible for heart contraction. In mammals, the contractile fiber resembles those of skeletal muscle but are only one third as large in diameter, are richer in sarcoplasm, and contain centrally located instead of peripheral nuclei.
cardiac muscle cell
Any neuron that is part of a central nervous system.
central nervous system neuron
date the sequencing run was performed
GROUP: IRIDA Ontology (Emma)
sequencing run date
Pre-filled, ready-to-use reagent cartridges. Used to produce improved chemistry, cluster density and read length as well as improve quality (Q) scores. Reagent components are encoded to interact with the sequencing system to validate compatibility with user-defined applications.
GROUP: IRIDA Ontology (Emma)
sequencing kit
Packaged kits (containing adapters, indexes, enzymes, buffers etc), tailored for specific sequencing workflows, which allow the simplified preparation of sequencing-ready libraries for small genomes, amplicons, and plasmids.
GROUP: IRIDA Ontology (Emma) and http://applications.illumina.com/applications/sequencing/ngs-library-prep.html
library preparation kit
organism datum
A location, relative to cellular compartments and structures, occupied by a macromolecular machine when it carries out a molecular function. There are two ways in which the gene ontology describes locations of gene products: (1) relative to cellular structures (e.g., cytoplasmic side of plasma membrane) or compartments (e.g., mitochondrion), and (2) the stable macromolecular complexes of which they are parts (e.g., the ribosome).
cellular_component
A semiautonomous, self replicating organelle that occurs in varying numbers, shapes, and sizes in the cytoplasm of virtually all eukaryotic cells. It is notably the site of tissue respiration.
mitochondrion
A neuron projection that has a short, tapering, morphology. Dendrites receive and integrate signals from other neurons or from sensory stimuli, and conduct nerve impulses towards the axon or the cell body. In most neurons, the impulse is conveyed from dendrites to axon via the cell body, but in some types of unipolar neuron, the impulse does not travel via the cell body.
dendrite
The portion of a cell bearing surface projections such as axons, dendrites, cilia, or flagella that includes the nucleus, but excludes all cell projections.
cell body
Any constituent part of a cell, the basic structural and functional unit of all organisms.
cell part
Software is a plan specification composed of a series of instructions that can be
interpreted by or directly executed by a processing unit.
see sourceforge tracker discussion at http://sourceforge.net/tracker/index.php?func=detail&aid=1958818&group_id=177891&atid=886178
PERSON: Alan Ruttenberg
PERSON: Bjoern Peters
PERSON: Chris Stoeckert
PERSON: Melanie Courtot
GROUP: OBI
software
A model number is an information content entity specifically borne by catalogs, design specifications, advertising materials, inventory systems and similar that is about manufactured objects of the same class. The model number is an alternative term for the class. The manufactered objects may or may not also bear the model number. Model numbers can be encoded in a variety of other information objects, such as bar codes, numerals, or patterns of dots.
manufactered items may have more than one model number, sometimes by rebranding, or because companies are sold and the products issued new model numbers
Person: Alan Ruttenberg
model number
a data item is an information content entity that is intended to be a truthful statement about something (modulo, e.g., measurement precision or other systematic errors) and is constructed/acquired by a method which reliably tends to produce (approximately) truthful statements.
2/2/2009 Alan and Bjoern discussing FACS run output data. This is a data item because it is about the cell population. Each element records an event and is typically further composed a set of measurment data items that record the fluorescent intensity stimulated by one of the lasers.
2009-03-16: data item deliberatly ambiguous: we merged data set and datum to be one entity, not knowing how to define singular versus plural. So data item is more general than datum.
2009-03-16: removed datum as alternative term as datum specifically refers to singular form, and is thus not an exact synonym.
2014-03-31: See discussion at http://odontomachus.wordpress.com/2014/03/30/aboutness-objects-propositions/
JAR: datum -- well, this will be very tricky to define, but maybe some
information-like stuff that might be put into a computer and that is
meant, by someone, to denote and/or to be interpreted by some
process... I would include lists, tables, sentences... I think I might
defer to Barry, or to Brian Cantwell Smith
JAR: A data item is an approximately justified approximately true approximate belief
PERSON: Alan Ruttenberg
PERSON: Chris Stoeckert
PERSON: Jonathan Rees
data
data item
An information content entity that is a mark(s) or character(s) used as a conventional representation of another entity.
20091104, MC: this needs work and will most probably change
2014-03-31: We would like to have a deeper analysis of 'mark' and 'sign' in the future (see https://github.com/information-artifact-ontology/IAO/issues/154).
PERSON: James A. Overton
PERSON: Jonathan Rees
based on Oxford English Dictionary
symbol
A generically dependent continuant that is about some thing.
2014-03-10: The use of "thing" is intended to be general enough to include universals and configurations (see https://groups.google.com/d/msg/information-ontology/GBxvYZCk1oc/-L6B5fSBBTQJ).
information_content_entity 'is_encoded_in' some digital_entity in obi before split (040907). information_content_entity 'is_encoded_in' some physical_document in obi before split (040907).
Previous. An information content entity is a non-realizable information entity that 'is encoded in' some digital or physical entity.
PERSON: Chris Stoeckert
OBI_0000142
information content entity
a scalar measurement datum is a measurement datum that is composed of two parts, numerals and a unit label.
2009-03-16: we decided to keep datum singular in scalar measurement datum, as in
this case we explicitly refer to the singular form
Would write this as: has_part some 'measurement unit label' and has_part some numeral and has_part exactly 2, except for the fact that this won't let us take advantage of OWL reasoning over the numbers. Instead use has measurment value property to represent the same. Use has measurement unit label (subproperty of has_part) so we can easily say that there is only one of them.
PERSON: Alan Ruttenberg
PERSON: Melanie Courtot
scalar measurement datum
An information content entity whose concretizations indicate to their bearer how to realize them in a process.
2009-03-16: provenance: a term realizable information entity was proposed for OBI (OBI_0000337) , edited by the PlanAndPlannedProcess branch. Original definition was "is the specification of a process that can be concretized and realized by an actor" with alternative term "instruction".It has been subsequently moved to IAO where the objective for which the original term was defined was satisfied with the definitionof this, different, term.
2013-05-30 Alan Ruttenberg: What differentiates a directive information entity from an information concretization is that it can have concretizations that are either qualities or realizable entities. The concretizations that are realizable entities are created when an individual chooses to take up the direction, i.e. has the intention to (try to) realize it.
8/6/2009 Alan Ruttenberg: Changed label from "information entity about a realizable" after discussions at ICBO
Werner pushed back on calling it realizable information entity as it isn't realizable. However this name isn't right either. An example would be a recipe. The realizable entity would be a plan, but the information entity isn't about the plan, it, once concretized, *is* the plan. -Alan
PERSON: Alan Ruttenberg
PERSON: Bjoern Peters
directive information entity
A plan specification which describes the inputs and output of mathematical functions as well as workflow of execution for achieving an predefined objective. Algorithms are realized usually by means of implementation as computer programs for execution by automata.
Philippe Rocca-Serra
PlanAndPlannedProcess Branch
OBI_0000270
adapted from discussion on OBI list (Matthew Pocock, Christian Cocos, Alan Ruttenberg)
algorithm
A data format specification is the information content borne by the document published defining the specification.
Example: The ISO document specifying what encompasses an XML document; The instructions in a XSD file
2009-03-16: provenance: term imported from OBI_0000187, which had original definition "A data format specification is a plan which organizes
information. Example: The ISO document specifying what encompasses an
XML document; The instructions in a XSD file"
PERSON: Alan Ruttenberg
PlanAndPlannedProcess Branch
OBI branch derived
OBI_0000187
data format specification
A directive information entity with action specifications and objective specifications as parts that, when concretized, is realized in a process in which the bearer tries to achieve the objectives by taking the actions specified.
2009-03-16: provenance: a term a plan was proposed for OBI (OBI_0000344) , edited by the PlanAndPlannedProcess branch. Original definition was " a plan is a specification of a process that is realized by an actor to achieve the objective specified as part of the plan". It has been subsequently moved to IAO where the objective for which the original term was defined was satisfied with the definitionof this, different, term.
2014-03-31: A plan specification can have other parts, such as conditional specifications.
Alternative previous definition: a plan is a set of instructions that specify how an objective should be achieved
Alan Ruttenberg
OBI Plan and Planned Process branch
OBI_0000344
plan specification
A measurement datum is an information content entity that is a recording of the output of a measurement such as produced by a device.
2/2/2009 is_specified_output of some assay?
person:Chris Stoeckert
OBI_0000305
group:OBI
measurement datum
A scalar measurement datum that is the result of measuring a temporal interval
2009/09/28 Alan Ruttenberg. Fucoidan-use-case
Person:Alan Ruttenberg
time measurement datum
A symbol that is part of a CRID and that is sufficient to look up a record from the CRID's registry.
PERSON: Alan Ruttenberg
PERSON: Bill Hogan
PERSON: Bjoern Peters
PERSON: Melanie Courtot
CRID symbol
Original proposal from Bjoern, discussions at IAO calls
centrally registered identifier symbol
A software application is software that can be directly executed by some processing unit.
PERSON: Melanie Courtot
PERSON: Michel Dumontier
https://github.com/information-artifact-ontology/IAO/issues/80
software application
Homo sapiens
A physical object, or electronic counterpart, that is characterized by containing writing which is meant to be human-readable.
Document
An organizational header for concepts representing mostly abstract entities.
Conceptual Entity
Anything (e.g., a document) providing permanent evidence of or information about past events.
Record
One or more characters used to identify, name, or characterize the nature, properties, or contents of a thing.
Identifier
A control number unique to an object. It is used to identify the object among other objects in a collection.
Accession Number
A set of characters used as a code that is unique in the context or the system for which it is created. It serves as a means of identification and reference (often instead of a name) for an entity, person, thing, function, procedure, activity, variable, or body of data.
Unique Identifier
A processual entity that realizes a plan which is the concretization of a plan specification.
'Plan' includes a future direction sense. That can be problematic if plans are changed during their execution. There are however implicit contingencies for protocols that an agent has in his mind that can be considered part of the plan, even if the agent didn't have them in mind before. Therefore, a planned process can diverge from what the agent would have said the plan was before executing it, by adjusting to problems encountered during execution (e.g. choosing another reagent with equivalent properties, if the originally planned one has run out.)
We are only considering successfully completed planned processes. A plan may be modified, and details added during execution. For a given planned process, the associated realized plan specification is the one encompassing all changes made during execution. This means that all processes in which an agent acts towards achieving some
objectives is a planned process.
Bjoern Peters
branch derived
planned process
a role realized through the process of supplying materials such as animal subjects, reagents or other materials used in an investigation.
Supplier role is a special kind of service, e.g. biobank
PERSON:Jennifer Fostel
material provider role
supplier
material supplier role
a role inhering in a material entity that is realized when characteristics or responses elicited by the substance are used for comparison or reference.
Person:Jennifer Fostel
reference substance
OBI
reference substance role
Is a material entity that is created or changed during material processing.
PERSON: Alan Ruttenberg
processed material
A planned process with the objective to produce information about the material entity that is the evaluant, by physically examining it or its proxies.
12/3/12: BP: the reference to the 'physical examination' is included to point out that a prediction is not an assay, as that does not require physical examiniation.
PlanAndPlannedProcess Branch
measuring
scientific observation
OBI branch derived
assay
a role which inheres in a molecular entity and is realized by the process of recording or registering a stimulus.
19feb2009. not clear we need this term. originally if came from microarrays -- the probes on the array are termed detectors in some instances
One that detects, especially a mechanical, electrical, or chemical device that automatically identifies and records or registers a stimulus, such as an environmental change in pressure or temperature, an electric signal, or radiation from a radioactive material. http://www.answers.com/topic/detector 19feb2009
detector reagent role
A planned process which results in physical changes in a specified input material
PERSON: Bjoern Peters
PERSON: Frank Gibson
PERSON: Jennifer Fostel
PERSON: Melanie Courtot
PERSON: Philippe Rocca Serra
material transformation
OBI branch derived
material processing
A role that is realized through the execution of a study design in which the bearer of the role participates and in which data about that bearer is collected.
A participant can realize both "specimen role" and "participant under investigation role" at the same time. However "participant under investigation role" is distinct from "specimen role", since a specimen could somehow be involved in an investigation without being the thing that is under investigation.
GROUP: Role Branch
OBI
participant under investigation role
a study personnel role played by a party who is accountable for the execution of a study component and can make decisions about the conduct of the study
Person: Jennifer Fostel
responsible party
OBI
responsible party role
a responsible party role played by a person responsible for the overall conduct of a study
Person: Jennifer Fostel
principal investigator
principal investigator role
a personnel role played by a party who executes a component of the study plan; this can occur before, during, after or outside the study timeline
"executes the study plan" includes the suppliers and manufacturers of reagents and other materials used in the study
Person:Jennifer Fostel
worker
OBI
worker role
a population is a collection of individuals from the same taxonomic class living, counted or sampled at a particular site or in a particular area
1/28/2013, BP, on the call it was raised that we may want to switch to an external ontology for all populatin terms:
http://code.google.com/p/popcomm-ontology/
PERSON: Philippe Rocca-Serra
adapted from Oxford English Dictionnary
population
a reference participant role realized by equivalent treatment of participants
Person:Jennifer Fostel
biological replicate
OBI
biological replicate role
A role borne by an entity and that is realized in a process that is part of an investigation in which an objective is achieved. These processes include, among others: planning, overseeing, funding, reviewing.
Implementing a study means carrying out or performing the study and providing reagents or other materials used in the study and other tasks without which the study would not happen.
Philly2013: Historically, this role would have been borne only by humans or organizations. However, we now also want to enable representing investigations run by robot scientists such as ADAM (King et al, Science, 2009)
GROUP: Role Branch
investigator
OBI
investigation agent role
a reference subject role which inheres in an organism or entity of organismal origin so that the characteristics or responses of the participant playing the reference participant role are used for comparison or reference
Jennifer Fostel
reference participant
OBI
reference subject role
An entity that can bear roles, has members, and has a set of organization rules. Members of organizations are either organizations themselves or individual people. Members can bear specific organization member roles that are determined in the organization rules. The organization rules also determine how decisions are made on behalf of the organization by the organization members.
BP: The definition summarizes long email discussions on the OBI developer, roles, biomaterial and denrie branches. It leaves open if an organization is a material entity or a dependent continuant, as no consensus was reached on that. The current placement as material is therefore temporary, in order to move forward with development. Here is the entire email summary, on which the definition is based:
1) there are organization_member_roles (president, treasurer, branch
editor), with individual persons as bearers
2) there are organization_roles (employer, owner, vendor, patent holder)
3) an organization has a charter / rules / bylaws, which specify what roles
there are, how they should be realized, and how to modify the
charter/rules/bylaws themselves.
It is debatable what the organization itself is (some kind of dependent
continuant or an aggregate of people). This also determines who/what the
bearer of organization_roles' are. My personal favorite is still to define
organization as a kind of 'legal entity', but thinking it through leads to
all kinds of questions that are clearly outside the scope of OBI.
Interestingly enough, it does not seem to matter much where we place
organization itself, as long as we can subclass it (University, Corporation,
Government Agency, Hospital), instantiate it (Affymetrix, NCBI, NIH, ISO,
W3C, University of Oklahoma), and have it play roles.
This leads to my proposal: We define organization through the statements 1 -
3 above, but without an 'is a' statement for now. We can leave it in its
current place in the is_a hierarchy (material entity) or move it up to
'continuant'. We leave further clarifications to BFO, and close this issue
for now.
PERSON: Alan Ruttenberg
PERSON: Bjoern Peters
PERSON: Philippe Rocca-Serra
PERSON: Susanna Sansone
GROUP: OBI
organization
technical replicate role is realized when two portions from one evaluant are used in replicate runs of an assay
Person: Jennifer Fostel
technical replicate
technical replicate role
A plan specification which has sufficient level of detail and quantitative information to communicate it between investigation agents, so that different investigation agents will reliably be able to independently reproduce the process.
PlanAndPlannedProcess Branch
OBI branch derived + wikipedia (http://en.wikipedia.org/wiki/Protocol_%28natural_sciences%29)
protocol
PCR is the process in which a DNA polymerase is used to amplify a piece of DNA by in vitro enzymatic replication. As PCR progresses, the DNA thus generated is itself used as a template for replication. This sets in motion a chain reaction in which the DNA template is exponentially amplified.
OBI Plan
PCR
adapted from wikipedai
polymerase chain reaction
DNA sequencing is a sequencing process which uses deoxyribonucleic acid as input and results in a the creation of DNA sequence information artifact using a DNA sequencer instrument.
Philippe Rocca-Serra
OBI Branch derived
DNA sequencing
A planned process with the objective of collecting a specimen.
Note: definition is in specimen creation objective which is defined as an objective to obtain and store a material entity for potential use as an input during an investigation.
Philly2013: A specimen collection can have as part a material entity acquisition, such as ordering from a bank. The distinction is that specimen collection necessarily involves the creation of a specimen role. However ordering cell lines cells from ATCC for use in an investigation is NOT a specimen collection, because the cell lines already have a specimen role.
Philly2013: The specimen_role for the specimen is created during the specimen collection process.
label changed to 'specimen collection process' on 10/27/2014, details see tracker:
http://sourceforge.net/p/obi/obi-terms/716/
Bjoern Peters
specimen collection
specimen collection process
is a process which results in the creation of a library from fragments of DNA using cloning vectors or oligonucleotides with the role of adaptors.
Philippe Rocca-Serra
library construction
library preparation
is a collection of short paired tags from the two ends of DNA fragments are extracted and covalently linked as ditag constructs
Philippe Rocca-Serra
mate-paired library
paired-end tag (PET) library
adapted from information provided by Solid web site
paired-end library
A device in which a measure function inheres.
GROUP:OBI Philly workshop
OBI
measurement device
A person or organization that has a manufacturer role
manufacturer
A device with a separation function realized in a planed process
material separation device
is a role which inheres in a person or organization and is realized in in a planned process which provides access to training, materials or execution of protocols for an organization or person
PERSON:Helen Parkinson
service provider role
A material entity that is designed to perform a function in a scientific investigation, but is not a reagent.
2012-12-17 JAO: In common lab usage, there is a distinction made between devices and reagents that is difficult to model. Therefore we have chosen to specifically exclude reagents from the definition of "device", and are enumerating the types of roles that a reagent can perform.
2013-6-5 MHB: The following clarifications are outcomes of the May 2013 Philly Workshop. Reagents are distinguished from devices that also participate in scientific techniques by the fact that reagents are chemical or biological in nature and necessarily participate in some chemical interaction or reaction during the realization of their experimental role. By contrast, devices do not participate in such chemical reactions/interactions. Note that there are cases where devices use reagent components during their operation, where the reagent-device distinction is less clear. For example:
(1) An HPLC machine is considered a device, but has a column that holds a stationary phase resin as an operational component. This resin qualifies as a device if it participates purely in size exclusion, but bears a reagent role that is realized in the running of a column if it interacts electrostatically or chemically with the evaluant. The container the resin is in (“the column”) considered alone is a device. So the entire column as well as the entire HPLC machine are devices that have a reagent as an operating part.
(2) A pH meter is a device, but its electrode component bears a reagent role in virtue of its interacting directly with the evaluant in execution of an assay.
(3) A gel running box is a device that has a metallic lead as a component that participates in a chemical reaction with the running buffer when a charge is passed through it. This metallic lead is considered to have a reagent role as a component of this device realized in the running of a gel.
In the examples above, a reagent is an operational component of a device, but the device itself does not realize a reagent role (as bearing a reagent role is not transitive across the part_of relation). In this way, the asserted disjointness between a reagent and device holds, as both roles are never realized in the same bearer during execution of an assay.
PERSON: Helen Parkinson
instrument
OBI development call 2012-12-17.
device
A measurement datum that representing the primary structure of a macromolecule(it's sequence) sometimes associated with an indicator of confidence of that measurement.
Person:Chris Stoeckert
GROUP: OBI
sequence data
a reference substance role that is borne by a material entity with a known amount which is mixed into the evaluant of assays for quality control or data normalization purposes
PERSON: Chris Stoeckert, Jie Zheng, Bjoern Peter
MO_937 spike_quality_control
spike-in quality control role
A time measurement datum that is the result of measurement of age of an organism
note that we are currently defining subtypes of age measurement datum that specify when the age is relative to, e.g. planting, as we don't have adequate temporal predicates yet.
life of bearer doesn't imply organism
this assay measures time not developmental stage. we recognize that development can take different time periods under different conditions such as media / temperature
age as a quality is dubious; we plan to revisit
stages in development are currently handled with controlled vocabulary, such as 2-somite stage
PERSON: Alan Ruttenberg, Chris Stoeckert, Jie Zheng
MO_178 Age
age measurement datum
An age measurement datum that is the result of the measurement of the age of an organism since birth, the process of emergence and separation of offspring from the mother.
PERSON:Chris Stoeckert, Jie Zheng
MO_710 birth
age since birth measurement datum
An organism that is bred to have some uniform behavioral, morphological, physiological, or genetic characteristics with similarly bred organisms
Bjoern Peters, Helen Parkinson, Philippe Rocca-Serra, Jie Zheng, Chris Stoeckert
cultivar
ecotype
strain
MO_9 StrainOrLine, MO_71 Ecotype, MO_124 Cultivar
selectively maintained organism
An assay in which sequencing technology (e.g. Solexa/454) is used to generate RNA sequence, analyse the transcibed regions of the genome, and or to quantitate transcript abundance
PERSON: James Malone
transcription profiling by high throughput sequencing
EFO_0002770 transcription profiling by high throughput sequencing
RNA-seq assay
A specimen primarily composed of a cell or cells collected from a multicellular organism or a cell culture.
Discussed on obi call Jan 23, 2017. To improve cell specimen that include single cell specimen. Details see tracker: https://sourceforge.net/p/obi/obi-terms/828/
PERSON: Chris Stoeckert, Jie Zheng, Alexander Diehl
MO_612 cell
cell specimen
An enzymatic amplification which amplifies nucleic acid sequence by making many copies off the same template.
PERSON: Chris Stoeckert, Jie Zheng
MO_997 linear_amplification
linear amplification
A CRID symbol denotes a specimen and used to distinguish one specimen from another in an investigation.
Person: Chris Stoeckert, Jie Zheng
specimen ID
NIAID GSCID-BRC metadata working group
specimen identifier
An assay that incorporates Paired-End Tags and sequencing technology to determine transcripts, gene structures, and gene expressions.
Person: Venkat Malladi, Chris Stoeckert, Jie Zheng
RNA-PET
RNA-PET assay
Ruan, et al. Genome wide full-length transcript analysis using 5' and 3' paired-end-tag next generation sequencing (RNA-PET). Methods Mol Biol. 2012;809:535-62. [PMID:22113299]
transcript analysis by paired-end tag sequencing
A library preparation that results in the creation of a library of the 5' and 3' ends of DNA or cDNA fragments using adaptors and endonucleases. The preparation may or may not include cloning process.
Person: Venkat Malladi, Jie Zheng
Venkat Malladi, Jie Zheng
paired-end library preparation
An organization that provides sequence determination service
PERSON: Chris Stoeckert, Jie Zheng
NIAID GSCID-BRC metadata working group
sequencing facility organization
A person who collects the specimen
PERSON: Chris Stoeckert, Jie Zheng
NIAID GSCID-BRC metadata working group
specimen collector
An information content entity that specifies a value within a classification scheme or on a quantitative scale.
This term is currently a descendant of 'information content entity', which requires that it 'is about' something. A value specification of '20g' for a measurement data item of the mass of a particular mouse 'is about' the mass of that mouse. However there are cases where a value specification is not clearly about any particular. In the future we may change 'value specification' to remove the 'is about' requirement.
PERSON:Bjoern Peters
value specification
a planned process which consists in running a set of samples as a pool in one single instrument run of data acquisition process while retaining the ability to associate individual results to each of the individual input samples thanks to the use of a multiplex identifier, introduced during the ligation step of the individual library preparation and specific to a given sample.
PERSON:Philippe Rocca-Serra
OBI
sequencing library multiplexing
A multiplexing sequence identifier is a nucleic acid sequence which is used in a ligation step of library preparation process to allow pooling of samples while maintaining ability to identify individual source material and creation of a multiplexed library
PERSON:Philippe Rocca-Serra
OBI
multiplexing sequence identifier
A cell specimen that contains only one cell.
Requested by Sirarat Sarntivijai (EBI). Details see tracker: https://sourceforge.net/p/obi/obi-terms/828/
PERSON: Jie Zheng, Alexander Diehl
PERSON: Jie Zheng, Alexander Diehl
single cell specimen
A data format specification that is a binary file containing base calling data. These files are created by sequencing machines during the act of sequencing, and contain data about each of the nucleotide clusters on a sequencing flow cell.
Stephen A. Fisher, Junhyong Kim, Dan Berrios
BCL format
A processed material developed by the External RNA Controls Consortium (ERCC) that consists of 92 transcripts, derived and traceable from NIST-certified DNA plasmids.
Stephen A. Fisher, Junhyong Kim, Dan Berrios
https://www.thermofisher.com/order/catalog/product/4456740
ERCC RNA spike-in
A processed material that consists of transcript isoforms of 7 spike-in RNA variant (SIRV) genes and possibly the 92 External RNA Controls Consortium (ERCC) RNA spike-in genes.
Stephen A. Fisher, Junhyong Kim, Dan Berrios
https://www.lexogen.com/sirvs/design/
SIRV RNA spike-in
An accession number that applies to a database record for an individual sample object in an NCBI Sequence Read Archive (SRA) submission. These records are imported from the NCBI BioSample database.
Stephen Fisher, Dan Berrios
SRA sample accession number
SRS identifier
An age measurement datum that is the result of the measurement of the age of a cell since cultured (the process of seeding cells onto a culture dish).
Stephen A. Fisher, Junhyong Kim, Dan Berrios
age of culture
age since culture seeding measurement datum
An algorithm used to assign aligned sequence reads, resulting from a reference genome transcriptome alignment algorithm, to sequence features (e.g. genes or transcripts).
Stephen A. Fisher, Junhyong Kim, Dan Berrios
alignment counting algorithm
A software application that implements an alignment counting algorithm, used to count the overlap of aligned sequencing reads with genes.
Stephen A. Fisher, Junhyong Kim, Dan Berrios
doi:10.1093/bioinformatics/btu638
alignment counting application
An algorithm that assigns nucleotides to temporal or spatial peaks generated by a detector, such as peaks of light intensities generated by a DNA sequencer
Stephen A. Fisher, Junhyong Kim, Dan Berrios
base calling algorithm
A base-calling application that implments a base-calling algorithm to convert BCL (binary base call) files, (e.g., those generated by Illumina sequencing systems) to standard FASTQ file formats for downstream analysis.
Stephen A. Fisher, Junhyong Kim, Dan Berrios
base calling application
A base calling application that implments a base calling algorithm to convert BCL (binary base call) files generated by Illumina sequencing systems to standard FASTQ file formats for downstream analysis.
Stephen A. Fisher, Junhyong Kim, Dan Berrios
https://support.illumina.com/sequencing/sequencing_software/bcl2fastq-conversion-software.html
bcl2fastq software application
A data item that is the date when a process was initiated.
Chris Stoeckert, Daniel Berrios, Stephen A. Fisher
date process started
A central registered identifier symbol that denotes a specific study in dbGaP.
Stephen A. Fisher, Junhyong Kim, Dan Berrios
dbGaP accession number
doi:10.1038/ng1007-1181
dbGaP study identifier
A worker role played by the person who runs the enzymatic amplifying process.
Stephen A. Fisher, Junhyong Kim, Dan Berrios
material amplification role
A worker role played by the person who performs the library preparation process to generate a sequencing library for a sample to be sequenced.
Stephen A. Fisher, Junhyong Kim, Dan Berrios
material sequencing library preparation role
A data item that is the number of PCR cycles used during the construction of a sequencing library.
Stephen A. Fisher, Junhyong Kim, Dan Berrios
library construction PCR cycles
https://en.wikipedia.org/wiki/Polymerase_chain_reaction
number of PCR cycles during library construction
A data item that is the number of times an amplification reaction happened.
Stephen A. Fisher, Junhyong Kim, Dan Berrios
number of rounds of amplification
A reference substance role that is played by a reference genome when used during sequence alignment.
Stephen A. Fisher, Junhyong Kim, Dan Berrios
reference genome role
An algorithm that attempts to align a nucleic acid sequence to a reference genome and transcriptome.
Stephen A. Fisher, Junhyong Kim, Dan Berrios
reference genome-transcriptome alignment algorithm
A measurement datum that is the result of the measurement of the number of bases in a DNA or RNA sequence.
Stephen A. Fisher, Junhyong Kim, Dan Berrios
read length, read length measurement datum
sequence read length measurement datum
A scalar measurement datum that indicates the amount of sequencing library used as input for a sequencer.
Stephen A. Fisher, Junhyong Kim
library input amount
sequencing library input quantity measurement datum
A cDNA library that is a collection of short tags from only one end of DNA fragments.
Stephen A. Fisher, Junhyong Kim, Dan Berrios
single-end library
A scalar measurement datum that indicates the amount of specimen collected.
Stephen A. Fisher, Junhyong Kim, Dan Berrios
harvest quantity
specimen harvest quantity
A data item that indicates the dilution of spike-in added to a specimen.
Stephen A. Fisher, Junhyong Kim, Dan Berrios
spike-in dilution factor
A reference genome transcriptome alignment algorithm that is a standalone RNA-seq alignment algorithm that uses uncompressed suffix arrays and a mapping algorithm similar to those used in large-scale genome alignment tools to align RNA-seq reads to a genomic reference.
Stephen A. Fisher, Junhyong Kim, Dan Berrios
https://www.ncbi.nlm.nih.gov/pubmed/23104886
star algorithm
A sequencing assay that incorporates single-end reads and sequencing technology to determine transcripts, gene structures, and gene expressions.
Stephen A. Fisher, Junhyong Kim, Dan Berrios
https://www.illumina.com/science/technology/next-generation-sequencing/paired-end-vs-single-read-sequencing.html
transcript analysis by single-end sequencing
An alignment counting algorithm that assigns gene features to genomic alignments using a hierarchical assignment scheme, which allows simultaneous quantification of multiple feature types or annotation levels without repeatedly assigning reads.
Stephen A. Fisher, Junhyong Kim
http://kim.bio.upenn.edu/software/verse.shtml or http://biorxiv.org/content/early/2016/05/14/053306
verse algorithm
A material separation device also commerically known as Isoraft, that is used to isolate single cells.
Stephen A. Fisher, Junhyong Kim, Dan Berrios
https://en.wikipedia.org/wiki/Microrafts
microraft
A device that is a laboratory tool commonly used in chemistry, biology and medicine to transport a measured volume of liquid, often as a media dispenser.
Stephen A. Fisher, Junhyong Kim, Dan Berrios
https://en.wikipedia.org/wiki/Pipette
pipette
A material entity that is an individual living system, such as animal, plant, bacteria or virus, that is capable of replicating or reproducing, growth and maintenance in the right environment. An organism may be unicellular or made up, like humans, of many billions of cells divided into specialized tissues and organs.
10/21/09: This is a placeholder term, that should ideally be imported from the NCBI taxonomy, but the high level hierarchy there does not suit our needs (includes plasmids and 'other organisms')
13-02-2009:
OBI doesn't take position as to when an organism starts or ends being an organism - e.g. sperm, foetus.
This issue is outside the scope of OBI.
GROUP: OBI Biomaterial Branch
WEB: http://en.wikipedia.org/wiki/Organism
organism
A material entity that has the specimen role.
Note: definition is in specimen creation objective which is defined as an objective to obtain and store a material entity for potential use as an input during an investigation.
PERSON: James Malone
PERSON: Philippe Rocca-Serra
GROUP: OBI Biomaterial Branch
specimen
A flow_cytometer_sorter is a flow_cytometer that analyzes and separates or sorts particles passing through (based on properties measured during analysis) to collect cells of interest.
John Quinn
Melanie Courtot
http://www.flocyte.com/FRTP/Resources/flow_cytometry_glossary.htm
flow cytometer sorter
A DNA sequencer is an instrument that determines the order of deoxynucleotides in deoxyribonucleic acid sequences.
Trish Whetzel
MO
DNA sequencer
the use of a chemical or biochemical means to infer the sequence of a biomaterial
has_output should be sequence of input; we don't have sequence well defined yet
PlanAndPlannedProcess Branch
OBI branch derived
sequencing assay
the use of enzymes to increase the number of molecules of a biomaterial
PERSON:Kevin Clancy
OBI branch derived
enzymatic amplification
An algorithm that attempts to align two molecular sequences (nucleic acid or protein sequences).
Ready for OBI-WS : v1.0
WS Annotations Group, UGA
WS Annotations Group, UGA
pairwise sequence alignment algorithm
BLAST (Basic Local Alignment Search Tool), a pairwise sequence alignment algorithm is a heuristic search algorithm for comparing primary biological sequence information, such as the amino-acid or nucleotides sequences with a database of sequences, and identify sequences that resemble the query sequence above a certain threshold.
Ready for OBI-WS : v1.0
ITPPR
Wikipedia: BLAST
BLAST algorithm
A data format specification for molecular sequence feature information.
Ready for OBI-WS : v1.0
ITPPR
sequence data format
EDAM
sequence data format specification
A sequence data that is about the primary structure of RNA.
ITPPR
ITPPR
RNA sequence data
A processual entity whose completion is hypothesized (by a healthcare provider) to alleviate the signs and symptoms associated with a disorder
Albert Goldfain
http://code.google.com/p/ogms/issues/detail?id=35
treatment
The principle investigtor or responsible person for a study or a sample collection.
study principle investigator or responsible
A data item that indicates the time when the collection of a specimen occurred.
Alice Nzinga
Mathias Brochhausen
date of specimen preparation
date of specimen collection
PERSON: Junhyong Kim
PERSON: Stephen A. Fisher
amplified date
A date process started item that indicates when a specimen was run through the amplification protocol prior to library preparation and sequencing.
date of specimen amplification
PERSON: Junhyong Kim
PERSON: Stephen A. Fisher
library preparation date
A date process started item that indicates when a specimen was run through the library preparation protocol prior to sequencing.
date of specimen library preparation
PERSON: Junhyong Kim
PERSON: Stephen A. Fisher
amplification protocol
A protocol document that contains detailed instructions on how to perform an enzymatic amplification planned process.
enzymatic amplification protocol
Exists in MS (Mass spectrometry ontology) with parent "quantification object attribute"
PERSON: Junhyong Kim
PERSON: Stephen A. Fisher
The name for identifying an experiment.
experiment name
PERSON: Junhyong Kim
PERSON: Stephen A. Fisher
https://www.illumina.com/products/by-type/sequencing-kits/cluster-gen-sequencing-reagents/hiseq-rapid-sbs-kit.html
A processed material that is a sequencing kit used with Illumina HiSeq 2500 and 1500 sequencers allowing for single-end or paired-end sequencing.
Illumina TruSeq Rapid SBS Kit v2
PERSON: Junhyong Kim
PERSON: Stephen A. Fisher
https://www.illumina.com/products/by-type/sequencing-kits/library-prep-kits/truseq-stranded-mrna.html
A processed material that is used to create a sequencing library that retains mRNA strand orientation.
Illumina TruSeq Stranded
PERSON: Junhyong Kim
PERSON: Stephen A. Fisher
A protocol document that contains detailed instructions on how to perform a library preparation planned process.
library preparation protocol
PERSON: Junhyong Kim
PERSON: Stephen A. Fisher
http://scap-t.org/?q=content/sample-attributes-data-dictionary
A data item that indicates the number of lanes in a flow cell that were used to sequence a specimen
number of flow cell lanes
PERSON: Junhyong Kim
PERSON: Stephen A. Fisher
http://scap-t.org/?q=content/sample-attributes-data-dictionary
A data item that indicates the number of flow cells used to sequence a specimen.
number of flow cells
PERSON: Junhyong Kim
PERSON: Stephen A. Fisher
A software application that implements a pairwise sequence alignment algorithm to attempt to align two molecular sequences (nucleic acid or protein sequences).
pairwise sequence alignment application
PERSON: Junhyong Kim
PERSON: Stephen A. Fisher
An area within a material anatomical entity that is distinct from the other areas of that entity such as the parietal cortex, hippocampus CA1 and amygdala.
region of material anatomical entity
PERSON: Junhyong Kim
PERSON: Stephen A. Fisher
A molecular entity that is an engineered molecule which enables the collection of RNA from cells.
RNA harvest compound
PERSON: Junhyong Kim
PERSON: Stephen A. Fisher
A material entity that contained the specimen during harvesting such as a primary culture, tissue, or fixed section.
specimen harvest source type
PERSON: Junhyong Kim
PERSON: Stephen A. Fisher
A material entity that represents the number of number of material entities used during harvesting; for example, a single cell, pooled single cells, tissue, a culture dish, or a coverslip.
specimen harvest source unit
PERSON: Junhyong Kim
PERSON: Stephen A. Fisher
https://github.com/alexdobin/STAR
A software application that implements a RNA-seq alignment algorithm that uses uncompressed suffix arrays and a mapping algorithm similar to those used in large-scale genome alignment tools to align RNA-seq reads to a genomic reference.
star
PERSON: Junhyong Kim
PERSON: Stephen A. Fisher
doi:10.1038/nmeth.2804
A molecular entity that is an engineered molecule which upon photoactivation enables mRNA capture from single cells in live tissue.
TIVA tag
PERSON: Junhyong Kim
PERSON: Stephen A. Fisher
http://kim.bio.upenn.edu/software/verse.shtml
An alignment counting application that assigns gene features to genomic alignments using a hierarchical assignment scheme, which allows simultaneous quantification of multiple feature types or annotation levels without repeatedly assigning reads.
verse
A collection of sequences (often chromosomes) taken as the standard for a given organism and genome assembly.
reference_genome
A level of depth of a taxon in a taxonomic hierarchy.
taxonomic_rank
species
Anatomical entity that has mass.
material anatomical entity
taxonomic rank
A role played by a a biological sample in the context of an experiment where the intent is that biological or technical variation is measured.
James Malone
replicate
FASTQ format is a text-based format for storing both a biological sequence (usually nucleotide sequence) and its corresponding quality scores.
FASTQ format
BAM is the compressed binary version of the Sequence Alignment/Map (SAM) format
BAM format
A genome is the full genetic content of an organism, contained in either DNA or RNA (such as for viruses).
James Malone
genome
The transcriptome is the set of all RNA molecules, including mRNA, rRNA, tRNA, and other non-coding RNA produced in one or a population of cells.
transcriptome
CEL data file format describes the format used in a CEL file for storing the results of the intensity calculations on the pixel values of a DAT file. This includes an intensity value, standard deviation of the intensity, the number of pixels used to calculate the intensity value, a flag to indicate an outlier as calculated by the algorithm and a user defined flag indicating the feature should be excluded from future analysis. The file stores the previously stated data for each feature on the probe array.
James Malone
CEL data file format
A BLAST algorithm that searches/aligns a nucleotide query sequence against a nucleotide database at a nucleotide level.
BLASTn