A Scalable Data Access Layer to Manage Structured Heterogeneous Biomedical Data. [PDF]
This work presents a scalable data access layer, called PyEHR, designed to support the implementation of data management systems for secondary use of structured heterogeneous biomedical and clinical data.
Giovanni Delussu+3 more
doaj +1 more source
Paper on Searching and Indexing Using Elasticsearch
In today’s era, it is inconceivable to use traditional techniques / RDBMS to analyse the data as it is growing very quickly. Big data offers the solution for analysing large amount of data. Using technique of Elasticsearch, access to data can be made faster. Elasticsearch is a search engine based on Lucene.
Devarshi Mehta, Darshita Kalyani
openaire +2 more sources
Study of Competition in the Textile Sector by Twitter Social Network Analysis
Social networks have become a place where brands or companies are advertised to increase their market share. The companies allocate resources to hire community managers that disseminate quality content and perform customer service tasks.
Ana Belén Gil González+3 more
doaj +3 more sources
Roaring Bitmaps: Implementation of an Optimized Software Library [PDF]
Compressed bitmap indexes are used in systems such as Git or Oracle to accelerate queries. They represent sets and often support operations such as unions, intersections, differences, and symmetric differences. Several important systems such as Elasticsearch, Apache Spark, Netflix's Atlas, LinkedIn's Pinot, Metamarkets' Druid, Pilosa, Apache Hive ...
arxiv +1 more source
Hdconfigor: Automatically Tuning High Dimensional Configuration Parameters for Log Search Engines
Search engines are nowadays widely applied to store and analyze logs generated by large-scale distributed systems. To adapt to various workload scenarios, log search engines such as Elasticsearch usually expose a large number of performance-related ...
Hui Dou, Pengfei Chen, Zibin Zheng
doaj +1 more source
Sherlock in OSS: A Novel Approach of Content-Based Searching in Object Storage System [PDF]
Object Storage Systems (OSS) inside a cloud promise scalability, durability, availability, and concurrency. However, open-source OSS does not have a specific approach to letting users and administrators search based on the data, which is contained inside the object storage, without involving the entire cloud infrastructure. Therefore, in this paper, we
arxiv
Big Data Processing for Full-Text Search and Visualization with Elasticsearch [PDF]
In this paper, the task of using Big Data to identify specific individuals on the indirect grounds of their interaction with information resources is considered. Possible sources of Big Data and problems related to its processing are analyzed. Existing means of data clustering are considered.
Shamil Magomedov+3 more
openaire +1 more source
A Big Data Architecture for Log Data Storage and Analysis [PDF]
We propose an architecture for analysing database connection logs across different instances of databases within an intranet comprising over 10,000 users and associated devices. Our system uses Flume agents to send notifications to a Hadoop Distributed File System for long-term storage and ElasticSearch and Kibana for short-term visualisation ...
arxiv +1 more source
A Fast Content-Based Image Retrieval Method Using Deep Visual Features [PDF]
Fast and scalable Content-Based Image Retrieval using visual features is required for document analysis, Medical image analysis, etc. in the present age. Convolutional Neural Network (CNN) activations as features achieved their outstanding performance in this area.
arxiv +1 more source
Optimizing Elasticsearch Search Experience Using a Thesaurus
The Belgian Art Links and Tools (BALaT) is the continuously expanding online documentary platform of the Royal Institute for Cultural Heritage (KIK-IRPA), Brussels (Belgium).
Emmanuel Di Pretoro+5 more
doaj