Marko Grobelnik

OntoGen: Semi-automatic Ontology Editor (2007)

Fortuna, Blaz, Grobelnik, Marko, Mladenić, Dunja

In this paper we present a semi-automatic ontology editor as implemented in a new version of OntoGen system. The system integrates machine learning and text mining algorithms into an efficient user...

User profiling for the web (2007)

Grcar, Miha, Mladenić, Dunja, Grobelnik, Marko

This paper addresses a problem of personalized information delivery related to the Web, that is based on user profiling. Different approaches to user profiling have been developed. When the user...

User study of ontology generation tool (2007)

Ilijasic Misic, Ivana, Kovacic, Bozidar, Mohoric, Tamara, Mladenić, Dunja, Fortuna, Blaz, Grobelnik, Marko

We present design and results of a user study undertaken in order to evaluate ontology generation process. We have applied our study to an example tool for semi-automatic ontology generation –...

Evaluation of semi-automatic ontology generation in real-world setting (2007)

Mladenić, Dunja, Grobelnik, Marko

This paper presents several aspects of evaluating semi-automatic ontology generation techniques in real-world setting. We provide description of incorporating the techniques in a solution to...

Machine learning for resolving researcher affiliation (2007)

Sterk, Marjan, Vladusic, Damijel, Milosevic, Eva, Ferlez, Jure, Mladenić, Dunja, Grobelnik, Marko

This paper describes the Institution Finder, an approach to develop a simple web mining procedure to find the internet domain of the institution(s) that a given researcher is affiliated with. The...

Triplet extraction from sentences (2007)

Rusu, Delia, Dali, Lorand, Fortuna, Blaz, Grobelnik, Marko, Mladenić, Dunja

In this paper we present an approach to extracting subject-predicate-object triplets from English sentences. To begin with, four different well known syntactical parsers for English are used for...

PREDICTING THE ADDITION OF NEW CONCEPTS IN A TOPIC HIERARCHY (2007)

Brank, Janez, Mladenić, Dunja, Grobelnik, Marko

Ontologies often change through time, a process largely done manually by human editors. We discuss the task of automatically predicting when structural changes will occur in a given ontology. We...

Extracting named entities and relating them over time based on Wikipedia (2007)

Bhole, Abhiji, Fortuna, Blaz, Grobelnik, Marko, Mladenić, Dunja

This paper presents an approach to mining information relating people, places, organizations and events extracted from Wikipedia and linking them on a time scale. The approach consists of two phases:...

From social network to light-weight ontology (2007)

Mladenić, Dunja, Grobelnik, Marko, Fortuna, Blaz

We address the problem of constructing a light-weight ontology from social network data. As an example we use social network of a mid size research institution obtained based on e-mail communication....

Using text mining and link analysis for software (2007)

Grcar, Miha, Grobelnik, Marko, Mladenić, Dunja

Many data mining techniques are these days in use for ontology learning – text mining, Web mining, graph mining, link analysis, relational data mining, and so on. In the current state-of-the-art...

Pascal Workshop, Complex Objects Visualization 2005 - COV2005, Proceedings (2006)

Pisanski, Tomaž, Horvat, Boris, Žerovnik, Janez, Mladenić, Dunja, Grobelnik, Marko, Anžič, Tina, ...

Pascal workshop on Complex Object Visualization COV-2005 brought together a group of researchers from various branches of Mathematics and Computer Science focused around a common theme that arises in...

Hierarchical text categorization using coding matrices (2006)

Brank, Janez, Mladenić, Dunja, Grobelnik, Marko

We discuss the task of ontology population as a machine learning problem with a large hierarchy of classes. Since many machine learning methods are designed primarily for two-class problems, it is...

Semi-automatic data-driven ontology construction system (2006)

Fortuna, Blaz, Grobelnik, Marko, Mladenić, Dunja

In this paper we present a new version of OntoGen system for semi-automatic data-driven ontology construction. The system is based on a novel ontology learning framework which formalizes and extends...

Extending IST World database with Serbian research publications, (2006)

Radovanovic, Milos, Ferlez, Jure, Mladenić, Dunja, Grobelnik, Marko, Ivanovic, Mirjana

This paper describes an effort of using knowledge technologies to gain insights into research activity, by exploiting publicly available information on research publications. The specificity of this...

System for Semi-automatic Ontology construction (2006)

Fortuna, Blaz, Grobelnik, Marko, Mladenić, Dunja

In this paper, we review two techniques for topic discovery in collections of text documents (Latent Semantic Indexing and KMeans clustering) and present how we integrated them into a system for...

Background Knowledge for Ontology Construction (2006)

Fortuna, Blaz, Grobelnik, Marko, Mladenić, Dunja

In this paper we describe a solution for incorporating background knowledge into the OntoGen system for semi-automatic ontology construction. This makes it easier for different users to construct...

Using DMoz for constructing ontology from data stream (2006)

Grobelnik, Marko, Brank, Janez, Mladenić, Dunja, Novak, Blaz, Fortuna, Blaz

This paper presents an approach for constructing an ontology from a stream of documents. Named entities extracted from the documents are used as instances of the ontology. Entities and co-occurring...

Knowledge discovery for ontology construction (2006)

Grobelnik, Marko, Mladenić, Dunja

We can observe that the focus of modern information systems is moving from ‘data-processing’ towards ‘concept-processing’, meaning that the basic unit of processing is less and less is the...

Visualization of text document corpus (2005)

Fortuna, Blaz, Mladenić, Dunja, Grobelnik, Marko

From the automated text processing point of view, natural language is very redundant in the sense that many different words share a common or similar meaning. For computer this can be hard to...

A survey of ontology evaluation techniques (2005)

Brank, Janez, Grobelnik, Marko, Mladenić, Dunja

An ontology is an explicit formal conceptualization of some domain of interest. Ontologies are increasingly used in various fields such as knowledge management, information extraction, and the...

Semi-automatic construction of topic ontology (2005)

Fortuna, Blaz, Mladenić, Dunja, Grobelnik, Marko

In this paper, we review two techniques for topic discovery in collections of text documents (Latent Semantic Indexing and K-Means clustering) and present how we integrated them into a system for...

User profiling for interest-focused browsing history (2005)

Grcar, Miha, Mladenić, Dunja, Grobelnik, Marko

User profiling is an important part of the Semantic Web as it integrates the user into the concept of Web data with machine-readable semantics. In this paper, user profiling is presented as a way of...

Using machine learning to structure the expertise of companies: analysis of the Yahoo! business data (2005)

Plisson, Joel, Mladenić, Dunja, Ljubic, Peter, Lavrac, Nada, Grobelnik, Marko

Organizations have to collaborate in order to achieve business goals which require to use a variety of domainspecific knowledge. Selection of partners with an appropriate expertise is one of the...

to preserve Slovenian digital heritage (2005)

Mladenić, Dunja, Grobelnik, Marko, Kavcic-Colic, Alenka

This paper describes an initiative for preserving Slovenian digital heritage via setting Slovenian national digital archive. We have proposed methodology for archiving electronic publications based...

Initiative to preserve Slovenian digital heritage (2005)

Mladenić, Dunja, Grobelnik, Marko, Kavcic-Colic, Alenka

This paper describes an initiative for preserving Slovenian digital heritage via setting Slovenian national digital archive. We have proposed methodology for archiving electronic publications based...

Automated Knowledge Discovery in Advanced Knowledge Management (2005)

Grobelnik, Marko, Mladenić, Dunja

Knowledge Management is a discipline with many faces – among very provocative ones is the research area dealing with automatic discovery of the hidden truth within the data describing the world...

Simple classification into large topic ontology of Web documents (2005)

Grobelnik, Marko, Mladenić, Dunja

The paper presents an approach to classifying Web documents into large topic ontology. The main emphasis is on having a simple approach appropriate for handling a large ontology and providing it with...

Simple classification into large topic ontology of Web documents (2005)

Grobelnik, Marko, Mladenić, Dunja

The paper presents an approach to classifying Web documents into large topic ontology. The main emphasis is on having a simple approach appropriate for handling a large ontology and providing it with...

Impact of Linguistic Analysis on the Semantic Graph Coverage and Learning of Document Extracts (2005)

Leskovec, Jure, Milic-Frayling, Natasa, Grobelnik, Marko

Automatic document summarization is a problem of creating a document surrogate that adequately represents the full document content. We aim at a summarization system that can replicate the quality of...

Visualizing very large graphs using clustering neighborhoods (2005)

Mladenić, Dunja, Grobelnik, Marko

This paper presents a method for visualization of large graphs in a two-dimensional space, such as a collection of Web pages. The main contribution here is in the representation change to enable...

Summarization and visualization (2005)

Mladenić, Dunja, Grobelnik, Marko

Both text summarization and visualization aim at providing some sort of general view of the text either giving a text summary in the required natural language or giving some visual representation of...

Applying collaborative filtering to real0life corporate data (2005)

Grcar, Miha, Mladenić, Dunja, Grobelnik, Marko

In this paper, we present our experience in applying collaborative filtering to real-life corporate data. The quality of collaborative filtering recommendations is highly dependent on the quality of...

Text classification with active learning (2005)

Novak, Blaz, Mladenić, Dunja, Grobelnik, Marko

In many real world machine learning tasks, labeled training examples are expensive to obtain, while at the same time there is a lot of unlabeled examples available. One such class of learning...

Mapping Documents onto Web Page Ontology (2004)

Mladenić, Dunja, Grobelnik, Marko

The paper describes an approach to automatically mapping Web pages onto ontology using document classification based on the Yahoo! ontology of Web pages. Techniques developed for learning on text...

VISUALIZATION OF NEWS ARTICLES (2004)

Grobelnik, Marko, Mladenić, Dr. Dunja

This paper presents a system for visualization of large amounts of new stories. In the first phase, the new stories are preprocessed for the purpose of name -entity extraction. Next, a graph of...

Learning Sub-structures of Document Semantic Graphs for Document Summarization (2004)

Leskovec, Jure, Grobelnik, Marko, Milic-Frayling, Natasa

In this paper we present a method for summarizing document by creating a semantic graph of the original document and identifying the substructure of such a graph that can be used to extract sentences...

SEKT: Semantically Enabled Knowledge Technologies (2004)

Thomas Gabel, York Sure, Johanna Voelker, Acknowledgements To Dunja Mladenic, Marko Grobelnik

Deliverable D12.1.1. (WP12.1) This document accompanies the SEKT web site and mailing lists. Important features of the web site, which consists of a public and a private part, as well as open issues...

Text Mining (2004)

Grobelnik, Marko, Mladenić, Dunja

This tutorial gives an overview of the Text Mining problem. After introducing the challenges faced, the various levels of text processing are discussed.

Interaction of Feature Selection Methods and Linear Classification Models (2002)

Janez Brank, Marko Grobelnik

In this paper we explore effects of various feature selection algorithms on document classification performance. We propose to use two, possibly distinct linear classifiers: one used exclusively for...

Text Mining as Integration of Several Related Research Areas: Report on KDD'2000 Workshop on Text Mining (2000)

Marko Grobelnik, Dunja Mladenic, Natasa Milic-frayling

In this paper we give an overview of the KDD'2000 Workshop on Text Mining that was held in Boston, MA on August 20, 2000. We report in detail on the research issues covered in the papers presented at...

Predicting Content from Hyperlinks (2000)

Marko Grobelnik

This paper describes an approach to prediction of a document content based on the hyperlink that points to the document. The k-Nearest Neighbor algorithm is used to predict a set of words that appear...

Predicting Content From Hyperlinks (2000)

Marko Grobelnik

This paper describes an approach to prediction of a document content based on the hyperlink that points to the document. The k-Nearest Neighbor algorithm is used to predict a set of words that appear...

Efficient Text Categorization (2000)

Marko Grobelnik

We present an approach to text categorization using machine learning techniques. The approach is developed and tested on large text hierarchy named Yahoo that is available on the Web. We handle the...

Assigning Keywords to Documents Using Machine Learning (1999)

Marko Grobelnik

This paper describes the usage of machine learning techniques to assign keywords to documents. The large hierarchy of documents available on the Web, Yahoo hierarchy, is used here as a real-world...

Assigning Keywords to Documents Using Machine Learning (1999)

Marko Grobelnik

This paper describes the usage of machine learning techniques to assign keywords to documents. The large hierarchy of documents available on the Web, Yahoo hierarchy, is used here as a real-world...

Assigning Keywords to Documents Using Machine Learning (1999)

Marko Grobelnik

This paper describes the usage of machine learning techniques to assign keywords to documents. The large hierarchy of documents available on the Web, Yahoo hierarchy, is used here as a real-world...

Feature selection for unbalanced class distribution and Naive Bayes (1999)

Marko Grobelnik

This paper describes an approach to feature subset selection that takes into account problem specifics and learning algorithm characteristics. It is developed for the Naive Bayesian classifier...

Feature selection for unbalanced class distribution and Naive Bayes (1999)

Marko Grobelnik

This paper describes an approach to feature subset selection that takes into account problem specifics and learning algorithm characteristics. It is developed for the Naive Bayesian classifier...

Feature selection for unbalanced class distribution and Naive Bayes (1999)

Marko Grobelnik

This paper describes an approach to feature subset selection that takes into account problem specifics and learning algorithm characteristics. It is developed for the Naive Bayesian classifier...

Assigning Keywords to Documents Using Machine Learning (1999)

Marko Grobelnik

This paper describes the usage of machine learning techniques to assign keywords to documents. The large hierarchy of documents available on the Web, Yahoo hierarchy, is used here as a real-world...

Word Sequences as Features in Text-Learning (1999)

Marko Grobelnik

This paper proposes an efficient algorithm for the generation of new features that enrich the known bagof -words document representation. New features are generated based on word sequences of...

Feature selection for unbalanced class distribution and Naive Bayes (1999)

Marko Grobelnik

This paper describes an approach to feature subset selection that takes into account problem specifics and learning algorithm characteristics. It is developed for the Naive Bayesian classifier...

Experiments In Learning Nonrecursive Definitions Of Relations With Linus (1999)

Nada Lavrac, Saso Dzeroski, Marko Grobelnik

Many successful inductive learning systems use a propositional attribute-value language for the representation of training examples and induced concept descriptions. Recent developments are concerned...

Word Sequences as Features in Text-Learning (1998)

Marko Grobelnik

This paper proposes an efficient algorithm for the generation of new features that enrich the known bagof -words document representation. New features are generated based on word sequences of...

Word Sequences as Features in Text-Learning (1998)

Marko Grobelnik

This paper proposes an efficient algorithm for the generation of new features that enrich the known bagof -words document representation. New features are generated based on word sequences of...

Word Sequences as Features in Text-Learning (1998)

Marko Grobelnik

This paper proposes an efficient algorithm for the generation of new features that enrich the known bagof -words document representation. New features are generated based on word sequences of...

Word Sequences as Features in Text-Learning (1998)

Marko Grobelnik

This paper proposes an efficient algorithm for the generation of new features that enrich the known bagof -words document representation. New features are generated based on word sequences of...

Feature Selection for Classification Based on Text Hierarchy (1998)

Marko Grobelnik

This paper describes automatic document categorization based on large text hierarchy. We handle the large number of features and training examples by taking into account hierarchical structure of...

Feature Selection for Classification Based on Text Hierarchy (1998)

Marko Grobelnik

This paper describes automatic document categorization based on large text hierarchy. We handle the large number of features and training examples by taking into account hierarchical structure of...

Feature Selection for Classification Based on Text Hierarchy (1998)

Marko Grobelnik

This paper describes automatic document categorization based on large text hierarchy. We handle the large number of features and training examples by taking into account hierarchical structure of...

Feature Selection for Classification Based on Text Hierarchy (1998)

Marko Grobelnik

This paper describes automatic document categorization based on large text hierarchy. We handle the large number of features and training examples by taking into account hierarchical structure of...

Feature Selection for Classification Based on Text Hierarchy (1998)

Marko Grobelnik

This paper describes automatic document categorization based on large text hierarchy. We handle the large number of features and training examples by taking into account hierarchical structure of...

Efficient Text Categorization (1998)

Marko Grobelnik

We present an approach to text categorization using machine learning techniques. The approach is developed and tested on large text hierarchy named Yahoo that is available on the Web. We handle the...

Efficient Text Categorization (1998)

Marko Grobelnik

We present an approach to text categorization using machine learning techniques. The approach is developed and tested on large text hierarchy named Yahoo that is available on the Web. We handle the...

Efficient Text Categorization (1998)

Marko Grobelnik

We present an approach to text categorization using machine learning techniques. The approach is developed and tested on large text hierarchy named Yahoo that is available on the Web. We handle the...

Efficient Text Categorization (1998)

Marko Grobelnik

We present an approach to text categorization using machine learning techniques. The approach is developed and tested on large text hierarchy named Yahoo that is available on the Web. We handle the...

Strategy Extraction When Playing Games (1995)

Marko Grobelnik, Vesna Prasnikar

Modelling of a subject's strategy is rather important task for many situations e.g. in economy. However, modelling is mainly done by hand, i.e. repeating the cycle of proposing a hypothesis model and...

Classification Tree Chaining in Data Analysis (1995)

Marko Grobelnik, Ray J. Paul, Ivan Bratko

Understanding of discrete event simulation systems is a difficult task. Our approach involves consultation with a domain expert, and the use of discrete event simulation model and machine learning as...

Classroom Games: Strategic Interaction on the Internet

Marko Grobelnik, Charles A. Holt, Vesna Prasnikar

Economics is often taught at a level of abstraction that can hinder some students from gaining basic intuition. However, lecture and textbook presentations can be complemented with classroom...