Skip to main content
2014 22nd Telecommunications Forum, TELFOR 2014 - Proceedings of Papers2014, Article number 7034561, Pages 943-94622nd Telecommunications Forum, TELFOR 2014; The Sava CenterBelgrade; Serbia; 25 November 2014 through 27 November 2014; Category numberCFP1498P-CDR; Code 116803

Inverted index search in data mining(Conference Paper)

  Save all to author list
  • aFaculty of Technical Science Kosovska Mitrovica, University of Pristina, Kneza Miloša 7, Pristina, 38220, Serbia
  • bFaculty of Informatics and Computing, Singidunum University, Danijelova 32, Belgrade, 11000, Serbia

Abstract

Data mining has its origins in various disciplines. Two most important data mining disciplines are statistics and machine learning. Data mining is a proceb of finding new, useful knowledge from data using different techniques. These techniques provide faster and better search for large amounts of data. Inverted index is structure that can be used in data mining proceb. That is a sorted list of words, with the list of corresponding documents attached to each word. Authors explored inverted index structure for a big corpus of documents. For that purpose, authors created application that use inverted index structure. Application uses open source library named Lucene. © 2014 IEEE.

Author keywords

ApplicationData MiningInverted IndexLuceneText Search

Indexed keywords

Engineering controlled terms:ApplicationsIndexing (of information)Learning systems
Engineering uncontrolled termsInverted index structuresInverted indicesLarge amounts of dataLuceneOpen-source librariesText search
Engineering main heading:Data mining
  • ISBN: 978-147996190-0
  • Source Type: Conference Proceeding
  • Original language: English
  • DOI: 10.1109/TELFOR.2014.7034561
  • Document Type: Conference Paper
  • Sponsors: ERICSSON - Belgrade,et al.,International Business Machines d.o.o. (IBM) - Belgrade,IRITEL a.d. BEOGRAD,Ministry of Education, Science and Technological Development,University of Belgrade, ETF - School of Electrical Engineering
  • Publisher: Institute of Electrical and Electronics Engineers Inc.

  Ilic, M.; Faculty of Technical Science Kosovska Mitrovica, University of Pristina, Kneza Miloša 7, Pristina, Serbia;
© Copyright 2023 Elsevier B.V., All rights reserved.

Cited by 13 documents

He, R. , Qu, Y.
Partitioned Inverted Index Compression Using Hierarchical Dirichlet Process
(2024) 2024 4th International Conference on Neural Networks, Information and Communication Engineering, NNICE 2024
Maden, E. , Karagoz, P.
Recent methods on short text stream clustering: A survey study
(2023) Wiley Interdisciplinary Reviews: Computational Statistics
Kim, B. , Jang, H.-J.
Genetic-Based Keyword Matching DBSCAN in IoT for Discovering Adjacent Clusters
(2023) CMES - Computer Modeling in Engineering and Sciences
View details of all 13 citations

SciVal Topic Prominence

Topic:
Prominence percentile: