A Hitchhiker’s Guide to Knowledge Galaxies

Mapping Conceptual Galaxies (Georg von Welling)

Preamble

Ontologies

The aim of ontologies is to make sense of the discourses about the nature of things. To that end they must weave together thesauruses, for the meaning of discourses, and models, for the representation of things:

  • Thesauruses deal with the meaning of terms, categories, and concepts.
  • Models add syntax to build structured representations of contexts and concerns.
  • Ontologies add pragmatics in order to ensure the functional integration of models and thesauruses in physical and symbolic environments. 
Ontology Architecture

Ontologies can be built from scratch or through the conceptual equivalent of APIs (Application Programming Interface).
The scratch option aims to redefine the whole of syntactical and semantic constructs, mixing meanings and representation; it ensues that its “open” capability (or ecumenism) is achieved through translations which are subject to cumulative updates that create exponential complexity.
By contrast the kernel option only adds ontological constructs to unambiguous conceptual primitives, maintaining a distinction between meanings (thesauruses) and representations (modeling languages). That makes for a two-tiered built-in interoperability, with models on the one hand, thesauruses on the other hand.

Basically, ontological entries can be organized in terms of:

  • Concepts: for pure semantic constructs defined independently of instances or categories
  • Categories: for symbolic descriptions of sets of objects or phenomena; Categories can be associated with actual (descriptive, extensional) or intended (prescriptive, intensional) sets of instances
  • Aspects: for symbolic descriptions of features (properties or behaviors)
  • Facts: for observations of objects or phenomena 
  • Documents: for the symbolic contents of media, with instances representing the actual (digital or physical) copies

The challenge is then to ensure the consistency and interoperability of meanings and representations.

Consistency & Interoperability

As far as enterprises are concerned, the architecture of ontologies should match information systems capabilities; to that end the contents of thesauruses and models can be expounded in terms of data (observations from environments), information (data+structures and semantics), and knowledge (information put to use):

Ontologies as Glue between Meanings & Purposes
  • Thesauruses federate meanings across digital (observations) and business (analysis) environments
  • Models deal with the integration of digital flows (environments) and symbolic representations (systems)
  • Ontologies ensure the integration of enterprises’ data, information, and knowledge, enabling a long-term alignment of enterprises’ organization, systems, and business objectives

The aim of this article is to consider how conceptual galaxies can ensure a seamless integration of representations, typically semantic networks and conceptual or knowledge graphs.

Networks, Graphs, & Galaxies

For thesauruses set in ontologies the primary objective is to attach words to environments’ objects and phenomena whatever their nature (concrete or symbolic) and naming source (external or internal). To that effect, names are first attached to observed data taking into account the specificities of contexts (e.g. business domains) and the heterogeneity of sources (e.g., data lakes or factories). Beyond the various terminologies, three basic kinds of networks can be used to organize and refine meanings:

  • Semantic networks represent the relationships between words
  • Conceptual networks/graphs introduce a built-in distinction for categories
  • Knowledge graphs add reasoning capabilities

Cut to the bone, these representations are all made of nodes and connectors, and the objective is to ensure a seamless integration of semantic, conceptual, and knowledge representations, and consequently the interoperability of corresponding applications, typically Natural language processing, Business intelligence, and decision-making.

Semantic Networks & Gravity

Assuming music-like lexical interoperability, thesauruses in ontologies should combine top-down and bottom-up perspectives:

  • Top-down meanings set by convention and specific to domains, defined literally on discrete scales.
  • Bottom-up meanings built from observed communications independently of domains, with meanings weighted relatively on continuous scales.

Semantic networks could thus provide a common launching pad, with connecting weights (SW) defined (solid lines) or computed (dashed lines) depending on perspective:

Defined (solid lines) and observed (dashed lines) semantic connections

That basis would then be extended to support conceptual and knowledge representation.

Conceptual Networks & Planetary Systems

Semantic networks have no built-in structural distinction between nodes that could be mapped to natural (e.g. Role and reference grammar) or modeling languages (e.g. UML) constructs. Nevertheless, taking a cue from astronomy and gravitational forces, nodes could be understood as concepts (or stars), primary terms (or planets), and secondary terms (or satellites), bound together by semantic attraction forces.

Semantic networks could thus be represented as planetary systems organised on three tiers:

  • Positive or negative forces between concepts
  • Orbits of primary terms around concepts
  • Orbits of secondary terms around primary ones
Planetary Patterns

Semantic weights (SW) would vary between -1 (antonym) and +1 (synonym), between concepts, from concepts to primary terms, and from primary to secondary terms; e.g.:

Semantic Galaxy
  • T1 meaning is set by Ca (SW=.8)
  • T2 meaning is set by Ca (SW=.4) and Cb (SW=.7)
  • T3 meaning is set by Ca (SW=.2) and Cc (SW=.-7)

It must be noted that such semantic planetary systems are set at thesaurus level and thus deal with meanings, whether defined top-down from business or institutional domains, or emerging bottom-up from communications independently of specific contexts. It ensues that planetary systems can also be associated with domains, ensuring a seamless integration of defined and observed meanings (thesauruses) with canned ones (models).

Knowledge Graphs & Galaxies

Planetary systems introduce three tiers of nodes and a transition to conceptual networks, providing a bridge between semantic networks (for thesauruses), conceptual graphs (for models), and knowledge graphs (for ontologies). These tiers can also be aligned with enterprise architecture layers:

  • Semantic networks are used to map observations (data) to thesauruses
  • Conceptual networks (semantic networks with planetary systems) can then integrate thesauruses with models (information)
  • Finally, knowledge graphs are used to build ontologies from conceptual networks and galaxies
Weaving the Fabric of Meanings

Ontological languages like OWL can provide for a seamless implementation of networks, graphs, and galaxies:

  • Nodes: individuals (or instances) for observations; categories for stars and planets
  • Connectors: realization (dashed, yellow), generalization/specialization (solid, yellow); semantics (solid, blue)
Graph & Galaxies Representation

Ontologies can then support a dynamic integration of live and managed meanings:

  • Live meanings: set through communications, represented by planetary systems
  • Managed meanings: defined by organizations, represented by domains

In return, that integration opens the door to universal knowledge interfaces.

Sailing Through Knowledge Galaxies

The digital transformation of enterprises combined with the ubiquity of AI and ML technologies raises new questions regarding accesses to knowledge:

  • At the system level: how to balance rule-based (symbolic, explicit knowledge) and deep-learning (non-symbolic, implicit knowledge) technologies
  • At the user level: how to guarantee the traceability and accountability of decision-making processes

Both questions, and more, can be best answered when set on a dual representation/communication perspective subject to the double constraint of explainability and expansibility.

Knowledge Representation: Ontologies & Learning

Rule-based and ML technologies are often misrepresented in terms of traditional vs new AI, missing their technical and functional continuity and complementarity.

With regard to continuity, advances in ML technologies have been driven by the simultaneous availability of massive data and computing power. With regard to complementarity, networks can serve as technical bridges between implicit (ML) and explicit (rule-based) knowledge.

Conceptual galaxies add functional dimensions to that complementarity:

  • Planetary systems with differentiated semantics (e.g. RRG and UML) are used to improve the consistency and interoperability of natural and modeling languages.
  • The dynamic nature of galaxies enables a continuous expanse of knowledge (aka learning), with ML for the mining of semantic weights observed in communication soup, planetary systems to map them to models, and galaxies to integrate the whole into knowledge graphs.

That holistic perspective can be translated into learning capabilities: observation (ML), reasoning (models), and judgment (knowledge graphs).

Knowledge Communication: Interfaces & Languages

As one would expect, the distinction between communication and representation in relation with knowledge is mirrored by the roles of languages, the nexus being the gear between syntax and semantics.

With regard to communication three tiers can be summarily identified:

  • Digital interfaces, for agents devoid of symbolic processing capabilities, commonly known as “things”.
  • Symbolic interfaces, for agents with unambiguous syntactic and semantic capabilities, typically traditional systems without AI abilities.
  • Natural language interfaces, for agents empowered with the whole of syntactic, semantic, and pragmatic capabilities, typically but not exclusively people.

The interoperability between these tiers has been hampered by theoretical as well as practical difficulties stemming mainly from the ways modeling and natural languages can be processed.

Being defined upfront modeling languages can set apart syntax, semantics, and pragmatics, with interfaces organized accordingly. By contrast, being practiced before being defined, natural languages mix formal with pragmatic tiers, inducing semantics-based interfaces. It ensues a structural discrepancy between communication interfaces at system and user levels, hampering the consistency and the interoperability of functions performed across levels.

Such discrepancies can be ironed out using conceptual galaxies to set apart lexical semantics unambiguously tied to syntax and lexicons, from pragmatic ones mixing meanings from lexicons, managed domains, and customary communications. Interfaces based on lexical semantics would ensure the consistency and interoperability of observation and reasoning across levels; they could be extended with pragmatic semantics supporting judgment expressed through natural languages.

FURTHER READING

%d bloggers like this: