New Models for Collaborative Textual Scholarship
Co-authored with Mark Hedges (lead author), Stuart Dunn, Charlotte Roueche, Marc W. Küster, Thomas Selig, Michael Bittorf, Waldemar Artes.
Accepted for 6th IEEE Int. Conf. on Digital Ecosystems and Technologies for Complex Systems, Environment, and Service Engineering IEEE-DEST 2012.
Final version will be uploaded after editing is finished.
Researchers in digital humanities have for many years been producing online editions of texts based on TEI XML, a... more Researchers in digital humanities have for many years been producing online editions of texts based on TEI XML, a widely-adopted standard for marking up textual resources with semantic content. However, this has led to a certain isolation of information, the so-called ‘digital silo', and such modes of digital publication have not always made best use of the possibilities of digital technologies. The model is also challenged by the need to model texts that are by their very nature interconnected. The paper describes a collaborative environment of tools and techniques for working with texts that allows scholars to work with such highly- interconnected material.
Standard biological parts knowledgebase
Michal Galdzicki, Cesar Rodriguez, Deepak Chandran, Herbert M. Sauro, John H. Gennari
We have created the Knowledgebase of Standard Biological Parts (SBPkb) as a publically accessible Semantic Web... more We have created the Knowledgebase of Standard Biological Parts (SBPkb) as a publically accessible Semantic Web resource for synthetic biology (sbolstandard.org). The SBPkb allows researchers to query and retrieve standard biological parts for research and use in synthetic biology. Its initial version includes all of the information about parts stored in the Registry of Standard Biological Parts (partsregistry.org). SBPkb transforms this information so that it is computable, using our semantic framework for synthetic biology parts. This framework, known as SBOL-semantic, was built as part of the Synthetic Biology Open Language (SBOL), a project of the Synthetic Biology Data Exchange Group. SBOL-semantic represents commonly used synthetic biology entities, and its purpose is to improve the distribution and exchange of descriptions of biological parts. In this paper, we describe the data, our methods for transformation to SBPkb, and finally, we demonstrate the value of our knowledgebase with a set of sample queries. We use RDF technology and SPARQL queries to retrieve candidate “promoter” parts that are known to be both negatively and positively regulated. This method provides new web based data access to perform searches for parts that are not currently possible.
SemKey: A Semantic Collaborative Tagging System
Tagging and Metadata for Social Information Organization Workshop at the 16th World Wide Web Conference (WWW), 2007
By analysing the current structure and the usage patterns of collaborative tagging systems, we can find out many... more By analysing the current structure and the usage patterns of collaborative tagging systems, we can find out many important aspects which still need to be improved. Problems related to synonymy, polysemy, different lexical forms, mispelling errors or alternate spellings, different levels of precision and different kinds of tag-to-resource association cause inconsistencies and reduce the efficiency of content search and the effectiveness of the tag space structuring and organization. They are mainly caused by the lack of semantic information inclusion in the tagging process. We propose a new way to describe resources: the semantic tagging. It allows user to state semantic assertions: each of them expresses a defined characteristic of a resource associating it with a concept. We present SemKey, a semantic collaborative tagging system, describing its global architecture and functioning along with the most relevant organizational issues faced. We explore the adequacy of the support offered by the entries of Wikipedia and WordNet in order to access to and reference concepts.
5 views
Seen by:Tagpedia: a Semantic Reference to Describe and Search for Web Resources
Social Web and Knowledge Management Workshop at the 17th World Wide Web Conference (WWW), 2008
Nowadays the Web represents a growing collection of an enormous amount of contents where the need for better ways to... more
Nowadays the Web represents a growing collection of an enormous amount of contents where the need for better ways to find and organize the available data is becoming a fundamental issue, in order to deal with information overload. Keyword based Web searches are actually the preferred mean to seek for contents related to a specific topic. Search engines and collaborative tagging systems make possible the search for information thanks to the association of descriptive keywords to Web resources. All of them show problems of inconsistency and consequent reduction of recall and precision of searches, due to polysemy, synonymy and in general all the different lexical forms that can be used to refer to a particular meaning. A possible way to face or at least reduce these problems is represented by the introduction of semantics to characterize the contents of Web resources: each resource is described by one or more concepts instead of simple and often ambiguous keywords. To support these task the availability of a global semantic resource of reference is fundamental. On the basis of our past experience with the semantic tagging of Web resources and the SemKey Project, we are developing Tagpedia, a general-domain ”encyclopedia” of tags, semantically structured for generating semantic descriptions of contents over the Web, created by mining Wikipedia. In this paper, starting from an analysis of the weak points of non-semantic keyword based Web searches, we introduce our idea of semantic characterization of Web resources describing the structure and organization of Tagpedia. We introduce our first realization of agpedia, suggesting all the possible improvements that can be carried
out in order to exploit its full potential.
3 views
Seen by:Semantify del.icio.us: automatically turn your tags into senses
Social Data on The Web Workshop at the 7th International Semantic Web Conference (ISWC), 2008
At present tagging is experimenting a great di usion as the most adopted way to collaboratively classify resources... more At present tagging is experimenting a great diusion as the most adopted way to collaboratively classify resources over the Web. In this paper, after a detailed analysis of the attempts made to improve the organization and structure of tagging systems as well as the usefulness of this kind of social data, we propose and evaluate the Tag Disambiguation Algorithm, mining del.icio.us data. It allows to easily semantify the tags of the users of a tagging service: it automatically nds out for each tag the related concept of Wikipedia in order to describe Web resources through senses. On the basis of a set of evaluation tests, we analyze all the advantages of our sense-based way of tagging, proposing new methods to keep the set of users tags more consistent or to classify the tagged resources on the basis of Wikipedia categories, YAGO classes or Wordnet synsets. We discuss also how our semanitied social tagging data are strongly linked to DBPedia and the datasets of the Linked Data community.
2 views
Seen by:Semantic Web in Comparison to Web2.0
TAGS: Semantic Web, Ontology, RDF, Twine, Metadata, Semantic Publishing
The Semantic Web is not a separate Web but an extension of the current one, in which information is given well-defined... more The Semantic Web is not a separate Web but an extension of the current one, in which information is given well-defined meaning, better enabling computers and people to work in cooperation. It is also referred to as Web 3.0 and is an upcoming technology. The Semantic Web is a web of data. The Semantic Web is about two things: it is about common formats for integration and combination of data drawn from diverse sources and about how the data relates to real world objects. Semantic Web can be considered a part of artificial intelligence as it puts into use Model theory, Web effect. Oracle, IBM, Adobe, Software AG, Mozilla and Yahoo! are some of the large corporations that have picked up this technology. This paper presents a comparison of Semantic Web and Web2.0 as well also discusses on the features and capabilities of Semantic Web.
Content, Format, and Interpretation
Dubin, D., Wickett, K. M., & Sacchi, S. (2011). Content, Format, and Interpretation. In B. T. Usdin (Ed.), Proceedings of Balisage: the Markup Conference 2011, Balisage Series on Markup Technologies (Vol. 7). Montreal, Canada.
doi:10.4242/BalisageVol7.Dubin01
Semantic technology and the question-centric curriculum
by Joshua Fost
In this article I describe software that facilitates “question-centric curricula” in which big questions, rather than... more In this article I describe software that facilitates “question-centric curricula” in which big questions, rather than academic disciplines, are the primary means of organizing educational resources. To find these questions, the software scans course catalogs and extracts all sentences ending in a question mark. To find connections between questions and courses, I present several computational techniques. One leverages the Library of Congress system; another implements so-called semantic technology that uses huge numbers of simple internet searches to ascertain the meaning of texts. The software assembles the results and shows, in one image, how every course at an institution relates to a given question.
Linked Data Tools: Semantic Web for the Masses
by Lisa Goddard
Goddard, Lisa and Byrne, Gillian (2010) Linked Data Tools: Semantic Web for the Masses. First Monday, 15 (11). ISSN 1396-0466
Semantic Web technologies have immense potential to transform the Internet into a distributed reasoning machine that... more Semantic Web technologies have immense potential to transform the Internet into a distributed reasoning machine that will not only execute extremely precise searches, but will also have the ability to analyze the data it finds to create new knowledge. This paper examines the state of Semantic Web (also known as Linked Data) tools and infrastructure to determine whether semantic technologies are sufficiently mature for non–expert use, and to identify some of the obstacles to global Linked Data implementation.
The Strongest Link: Libraries and Linked Data
by Lisa Goddard
Goddard, Lisa and Byrne, Gillian (2010) The Strongest Link: Libraries and Linked Data. D-Lib Magazine, 16 (11/12). ISSN 1082-9873
Since 1999 the W3C has been working on a set of Semantic Web standards that have the potential to revolutionize web... more Since 1999 the W3C has been working on a set of Semantic Web standards that have the potential to revolutionize web search. Also known as Linked Data, the Machine-Readable Web, the Web of Data, or Web 3.0, the Semantic Web relies on highly structured metadata that allow computers to understand the relationships between objects. Semantic web standards are complex, and difficult to conceptualize, but they offer solutions to many of the issues that plague libraries, including precise web search, authority control, classification, data portability, and disambiguation. This article will outline some of the benefits that linked data could have for libraries, will discuss some of the non-technical obstacles that we face in moving forward, and will finally offer suggestions for practical ways in which libraries can participate in the development of the semantic web
A data integration concept for an interdisciplinary research database
WILLMES, C. and BARETH, G. (2012): A data integration concept for an interdisciplinary research database. In: Proceedings of the Young Researchers forum on Geographic Information Science - GI Zeitgeist, ifgiPrints 44, Münster, Germany, March 2012, ISBN: 978-3-89838-663-0, Akademische Verlagsgesellschaft AKA, Heidelberg, pp. 67 - 72.
This paper presents an overview of the current state of development of an archaeological and a palaeoenvironmental... more This paper presents an overview of the current state of development of an archaeological and a palaeoenvironmental data model for an interdisciplinary research database. The models are constructed iteratively by integrating heterogeneous data and adjusting the model where necessary. The integration concept is an iterative approach which combines several techniques for data model development, including semantic and syntactic integration and alignment, as well as semantic data linkage with external knowledgebases and models. The goal is to provide integrated spatio-temporal access to an existing wealth of data to facilitate research on the integrated data basis.
40 views
Seen by:Technically approaching the semantic web bottleneck
Co-authored with Dimitrios-Emmanuel Spanos, Periklis Stavrou and Nikolas Mitrou, in International Journal of Web Engineering and Technology (IJWET), 6(1): 83-111, 2010
After several years of research, the fundamental semantic web technologies have reached a high maturity level.... more After several years of research, the fundamental semantic web technologies have reached a high maturity level. Nevertheless, the average web user has not yet taken advantage of their full potential. In this paper, we introduce the semantic web bottleneck, analyse the main problems that preserve it and suggest ways to overcome it. In particular, we discuss the issues involved in deploying, maintaining and using semantically rich web applications, decomposing this process into two primal ones: publishing and exploiting semantic content. We analyse the role of key players such as the web industry, the search engines, the academia, the web user, and the web engineers that essentially materialise and use these technologies. A roadmap is provided in order for the semantic web to gain further acceptance, based on three major axes: simplicity, mainly entailed by automation, integration with the existing technologies and practices, and adoption by the web industry driving forces.
An approach to Intelligent Information Fusion in Sensor Saturated Urban Environments
Co-authored with Ch. Doulaverakis, Th. Knape, I. Kompatsiaris and J. Soldatos, In European Intelligence and Security Informatics Conference (EISIC 2011), co-sponsored by IEEE, Athens, Greece, September 2011
This paper introduces a novel sensor information fusion system enabling security and surveillance in large scale... more
This paper introduces a novel sensor information fusion system enabling security and surveillance in large scale sensor saturated urban environments. The system is built over state-of-the art sensor networks middleware and provides information fusion at multiple layers. A distinguishing characteristic of the system is that it support seamless integration with semantic web middleware (including ontologies and inference mechanisms), which enable intelligent high-level accurate reasoning. This is a key functionality for efficient surveillance in large scale
environment, where manual inspection of individual tracking systems becomes extremely resourceful and overall impractical. A proof-of-concept implementation of the system manifests its benefits and technical challenges, while also outlining lessons learnt.
Semantic Search and Browsing nei Beni Culturali
in "Bollettino del CILEA", 118, dicembre 2011, pp. 26-30
27 views
Seen by:Exploring Manuscripts: Sharing Ancient Wisdoms across the Semantic Web
Co-authored with Mark Hedges, K. Faith Lawrence and Charlotte Tupman. For the ACM International Conference on Web Intelligence, Mining and Semantics (WIMS) June 13-15, 2012 Craiova, Romania
Recent work in digital humanities has seen researchers increasingly producing online editions of texts and... more
Recent work in digital humanities has seen researchers increasingly producing online editions of texts and manuscripts,
particularly in adoption of the TEI XML format for online
publishing. The benefits of semantic web techniques are underexplored in such research, however, with a lack of sharing
and communication of research information. The Sharing
Ancient Wisdoms (SAWS) project applies linked data practices to enhance and expand on what is possible with these
digital text editions. Focussing on Greek and Arabic collections of ancient wise sayings, which are often related to
each other, we use RDF to annotate and extract semantic information from the TEI documents as RDF triples.
This allows researchers to explore the conceptual networks
that arise from these interconnected sayings. The SAWS
project advocates a semantic-web-based methodology, enhancing rather than replacing current workflow processes,
for digital humanities researchers to share their findings and
collectively benefit from each other's work.
22 views
Seen by:
