Language resource management — Comprehensive Annotation Framework (ComAF) — Part 3: Diagrammatic semantic authoring (DSA)

This document specifies how to represent (not visualize) documents (instance data, not data schemas) as graphs. It does not specify how to visualize or operate on document data, but it aims at making documents easier for people to compose and comprehend by allowing for various graph-based flexible user interfaces, possibly incorporating document-visualization practices (see Introduction). In this connection, this document does not specify annotations to existing documents either, but rather it specifies a schema of documents with explicit logical structures.

Gestion des ressources linguistiques — Cadre global d’annotation (ComAF) — Partie 3: Création sémantique diagrammatique (DSA)

ISO/TC 37/SC 4 Date: 2021-02 ISO 24627-3:2021(F) ISO/TC 37/SC 4/GT 5 Secrétariat: KATS Gestion des ressources linguistiques — Cadre global d'annotation (ComAF) — Partie 3: Création sémantique diagrammatique (DSA) Language Resource management — Comprehensive annotation framework (ComAF) — Part 3: Diagrammatic semantic authoring (DSA) DOCUMENT PROTÉGÉ PAR COPYRIGHT Droits de reproduction réservés. Sauf indication contraire, aucune partie de cette publication ne peut être reproduite ni utilisée sous quelque forme que ce soit et par aucun procédé, électronique ou mécanique, y compris la photocopie, l'affichage sur l'internet ou sur un Intranet, sans autorisation écrite préalable. Les demandes d'autorisation peuvent être adressées à l'ISO à l'adresse ci-après ou au comité membre de l'ISO dans le pays du demandeur. ISO copyright office Ch. de Blandonnet 8 ? CP 401 CH-1214 Vernier, Geneva, Switzerland Tel. + 41 22 749 01 11 Fax + 41 22 749 09 47 copyright@iso.org www.iso.org Sommaire Page Avant-propos........................................................................................................................................ 4 Introduction.......................................................................................................................................... 5 1 Domaine d'application........................................................................................................... 1 2 Références normatives........................................................................................................... 1 3 Termes et définitions............................................................................................................... 1 4 Spécification............................................................................................................................. 2 Annexe A (informative) Exemples...................................................................................................... 4 Bibliographie...................................................................................................................................... 10 Avant-propos L'ISO (Organisation internationale de normalisation) est une fédération mondiale d'organismes nationaux de normalisation (comités membres de l'ISO). L'élaboration des Normes internationales est en général confiée aux comités techniques de l'ISO. Chaque comité membre intéressé par une étude a le droit de faire partie du comité technique créé à cet effet. Les organisations internationales, gouvernementales et non gouvernementales, en liaison avec l'ISO participent également aux travaux. L'ISO collabore étroitement avec la Commission électrotechnique internationale (IEC) en ce qui concerne la normalisation électrotechnique. Les procédures utilisées pour élaborer le présent document et celles destinées à sa mise à jour sont décrites dans les Directives ISO/IEC, Partie 1. Il convient, en particulier, de prendre note des différents critères d'approbation requis pour les différents types de documents ISO. Le présent document a été rédigé conformément aux règles de rédaction données dans les Directives ISO/IEC, Partie 2 (voir www.iso.org/directives). L'attention est attirée sur le fait que certains des éléments du présent document peuvent faire l'objet de droits de propriété intellectuelle ou de droits analogues. L'ISO ne saurait être tenue pour responsable de ne pas avoir identifié de tels droits de propriété et averti de leur existence. Les détails concernant les références aux droits de propriété intellectuelle ou autres droits analogues identifiés lors de l'élaboration du document sont indiqués dans l'Introduction et/ou dans la liste des déclarations de brevets reçues par l'ISO (voir www.iso.org/brevets). Les appellations commerciales éventuellement mentionnées dans le présent document sont données pour information, par souci de commodité, à l'intention des utilisateurs et ne sauraient constituer

Upravljanje jezikovnih virov - Ogrodje za celovito označevanje (ComAF) - 3. del: Diagramsko semantično avtorstvo (DSA)

General Information

Status
Published
Publication Date
07-Feb-2021
Current Stage
6060 - International Standard published
Start Date
08-Feb-2021
Due Date
16-Nov-2020
Completion Date
08-Feb-2021
Standard
ISO 24627-3:2021 - BARVE
English language
14 pages
sale 10% off
Preview
sale 10% off
Preview
e-Library read for
1 day

Standards Content (Sample)


SLOVENSKI STANDARD
01-junij-2021
Upravljanje jezikovnih virov - Ogrodje za celovito označevanje (ComAF) - 3. del:
Diagramsko semantično avtorstvo (DSA)
Language resource management -- Comprehensive Annotation Framework (ComAF) -
Part 3: Diagrammatic semantic authoring (DSA)
Gestion des ressources linguistiques -- Cadre global d’annotation (ComAF) - Partie 3:
Création sémantique diagrammatique (DSA)
Ta slovenski standard je istoveten z: ISO 24627-3:2021
ICS:
01.020 Terminologija (načela in Terminology (principles and
koordinacija) coordination)
01.140.20 Informacijske vede Information sciences
2003-01.Slovenski inštitut za standardizacijo. Razmnoževanje celote ali delov tega standarda ni dovoljeno.

INTERNATIONAL ISO
STANDARD 24627-3
First edition
2021-02
Language resource management —
Comprehensive Annotation
Framework (ComAF) —
Part 3:
Diagrammatic semantic authoring
(DSA)
Gestion des ressources linguistiques — Cadre global d’annotation
(ComAF) —
Partie 3: Création sémantique diagrammatique (DSA)
Reference number
©
ISO 2021
© ISO 2021
All rights reserved. Unless otherwise specified, or required in the context of its implementation, no part of this publication may
be reproduced or utilized otherwise in any form or by any means, electronic or mechanical, including photocopying, or posting
on the internet or an intranet, without prior written permission. Permission can be requested from either ISO at the address
below or ISO’s member body in the country of the requester.
ISO copyright office
CP 401 • Ch. de Blandonnet 8
CH-1214 Vernier, Geneva
Phone: +41 22 749 01 11
Email: copyright@iso.org
Website: www.iso.org
Published in Switzerland
ii © ISO 2021 – All rights reserved

Contents Page
Foreword .iv
Introduction .v
1 Scope . 1
2 Normative references . 1
3 Terms and definitions . 1
4 Specification . 1
Annex A (informative) Examples . 3
Bibliography . 8
Foreword
ISO (the International Organization for Standardization) is a worldwide federation of national standards
bodies (ISO member bodies). The work of preparing International Standards is normally carried out
through ISO technical committees. Each member body interested in a subject for which a technical
committee has been established has the right to be represented on that committee. International
organizations, governmental and non-governmental, in liaison with ISO, also take part in the work.
ISO collaborates closely with the International Electrotechnical Commission (IEC) on all matters of
electrotechnical standardization.
The procedures used to develop this document and those intended for its further maintenance are
described in the ISO/IEC Directives, Part 1. In particular the different approval criteria needed for the
different types of ISO documents should be noted. This document was drafted in accordance with the
editorial rules of the ISO/IEC Directives, Part 2 (see www .iso .org/ directives).
Attention is drawn to the possibility that some of the elements of this document may be the subject of
patent rights. ISO shall not be held responsible for identifying any or all such patent rights. Details of
any patent rights identified during the development of the document will be in the Introduction and/or
on the ISO list of patent declarations received (see www .iso .org/ patents).
Any trade name used in this document is information given for the convenience of users and does not
constitute an endorsement.
For an explanation of the voluntary nature of standards, the meaning of ISO specific terms and
expressions related to conformity assessment, as well as information about ISO's adherence to the
World Trade Organization (WTO) principles in the Technical Barriers to Trade (TBT), see www .iso .org/
iso/ foreword .html.
This document was prepared by Technical Committee ISO/TC 37, Language and terminology,
Subcommittee SC 4, Language resource management.
A list of all parts in the ISO 24627 series can be found on the ISO website.
Any feedback or questions on this document should be directed to the user’s national standards body. A
complete listing of these bodies can be found at www .iso .org/ members .html.
iv © ISO 2021 – All rights reserved

Introduction
Graphs (diagrams consisting of nodes and links) have been used for decades to represent and visualize
both documents (instance data) and data schemas. This document concerns graph-based representation
(not visualization) of documents (not data schemas).
[15]
Graph-based representation and visualization of documents are addressed by concept maps, mind
maps, argument maps, and so on. Theoretical linguistics and artificial intelligence have also used
[10]
graph-based content visualization associated with semantic network, mental space, discourse
[13]
representation structure, and so forth.
Graph-based visualization of data schemas (or ontologies, terminologies, metamodels, etc.) is a more
usual practice. Ontologies are often visualized as graphs in which nodes are classes (and datatypes) and
links are properties (relations). ISO 24156-1 specifies a UML-based visualization of concept modelling.
Other metamodels are usually represented as similar diagrams, too.
This document gives a data schema of graph documents to facilitate composition and comprehension by
making logical document structure explicit. It neither covers visualizations or manipulations of graphs
nor does it define annotations to existing documents, but rather it addresses graphical/diagrammatic
representation of documents for the sake of semantic authoring: i.e., for people to directly view and
manipulate syntactic/semantic structures on computer displays or their future alternatives. The
linearity of traditional text documents is due to the linearity of speech languages, which constrains the
interaction between people and documents, making it hard for people to read and write. DSA defines
graphical/diagrammatic documents with more explicit structures than in text in order to make it easier
for people to read and write. Documents based on DSA, together with some user interfaces involving
appropriate visualizations and easy operations, can enhance collaborations among people and between
people and machines.
DSA mainly deals with syntactic or document structures. It addresses some fragmentary semantic
structures as well, but more systematic semantics (formal mapping between documents and
their meanings or logical forms) can be provided by another specification so that machines better
‘understand’ DSA-based documents and thereby better assist information sharing and consensus
building among people.
Figure 1 shows a workflow involving DSA and other types of documents. The DSA-based documents
in the upper half can be automatically converted (while preserving propositional content) to and from
machine-understandable documents based on appropriate standards on semantic representations
and annotations. It is possible to automatically generate traditional text documents from these
machine-understandable documents (while preserving the propositional content, too), though the
inverse conversion cannot generally be automated. Since DSA-based documents (together with some
appropriate user interfaces) are easier for people to compose and interpret than text documents,
people can usually touch and see DSA-based documents whereas traditional documents could be used
for legacy procedures (such as patent applications) and oral presentations.
Figure 1 — Document workflow involving DSA
DSA is a minimal metamodel for ISO TS 24617-5 (SemAF-DS), which in turn is based on ISO/IEC 15938.5/
Amd.1 (MPEG-7 MDS AMD1 ― Linguistic description scheme). The machine-understandable documents
in Figure 1 are assumed to use other standards including ISO 24615 (SynAF), ISO 24612 (LAF) and
[1][8][9][10][11][12][13]
ISO 24617 (SemAF) while also incorporating insights from other relevant literature
[14][15][16][17][18]
.
vi © ISO 2021 – All rights reserved

INTERNATIONAL STANDARD ISO 24627-3:2021(E)
Language resource management — Comprehensive
Annotation Framework (ComAF) —
Part 3:
Diagrammatic semantic authoring (DSA)
1 Scope
This document specifies how to represent (not visualize) documents (instance data, not data schemas)
as graphs. It does not specify how to visualize or operate on document data, but it aims at making
documents easier for people to compose and comprehend by allowing for various graph-based flexible
user interfaces, possibly incorporating document-visualization practices (see Introduction). In this
connection, this document does not specify annotations to existing documents either, but rather it
specifies a schema of documents with explicit logical structures.
2 Normative references
There are no normative references in this document.
3 Terms and definitions
For the purposes of this document, the following terms and definitions apply.
ISO and IEC maintain terminological databases for use in standardization at the following addresses:
— ISO Online browsing platform: available at https:// www .iso .org/ obp
— IEC Electropedia: available at http:// www .electropedia .org/
3.1
hypernode
node which is a graph segment
3.2
segment
referenceable part of a DSA-based document, which is either a graph segment or a data segment (text,
image, audio, video, etc.)
3.3
semantic authoring
composition of documents while making their logical structures explicit
4 Specification
Figure 2 illustrates DSA, which is a metamodel (ontology) of graph documents. Each markable
(referenceable) part of a DSA-based document is called a segment. Each DSA-based document itself is a
segment, too.
Figure 2 — DSA metamodel (ontology)
A graph segment shall be a labelled directed graph comprising nodes and links. A node shall be a
segment and a link shall be a relation between two endnodes (source and destination segments).
A node in a graph segment can be another graph segment. A node which is a graph segment is called a
hypernode. So a graph segment can embed other graph segments as hypernodes, which are rarely used
in concept maps. Arbitrarily large DSA-based documents can hence be made of small graph segments
which are hypernodes of each other.
A data segment can embed smaller data segments as constituents. For instance, a discourse can be a
text/audio/video segment embedding sentence utterances as smaller text/audio/video segments,
which can further embed still smaller text/audio/video segments such as phrases, and so forth.
DSA is exactly the metamodel in Figure 2, which formally specifies multimodal graph documents
involving hypernodes, but does not specify how to visualize or operate on them. The diagrammatic
visualizations in Annex A are not part of DSA, they are informative examples. DSA users can adopt any
sorts of diagrammatic visualization as far as the logical document structures are made explicit enough
to meet their purposes.
2 © ISO 2021 – All rights reserved

Annex A
(informative)
Examples
A.1 Graph segments
In Figure A.1, graph segments are visualized as rounded boxes. The smaller graph segment is a
hypernode in the larger one. Most nodes in Figure A.1 are text segments, but nodes can be other types of
segments as well. Links in this example represent discourse relations and thematic roles. For instance,
a ‘core’ and an ‘axis’ are constituents of a ‘pencil’ and a function of an ‘eraser’ is to ‘erase mainly pencil
writings.’
Figure A.1 — Graph segment representing part of a patent document
Throughout this document, anaphors are underlined and their antecedents are in boldface characters.
Anaphoric relations can be explicitly visualized as links as shown in Figure A.2. Some endnodes of these
anaphoric links are text segments embedded in larger graph/text segments and thus those endnodes do
not belong to the graphs to which the anaphoric relations belong. A problem is that a graph visualizing
too many such anaphoric relations is hard for people to understand and compose. This problem is
partially addressed by visualizing graphs as trees.
SIST IS
...

Questions, Comments and Discussion

Ask us and Technical Secretary will try to provide an answer. You can facilitate discussion about the standard in here.