Export Citations
Save this search
Please login to be able to save your searches and receive alerts for new content matching your search criteria.
- invited-talkApril 2023
Graph-Inceptor: Towards Extreme Data Ingestion, Massive Graph Creation and Storage
ICPE '23 Companion: Companion of the 2023 ACM/SPEC International Conference on Performance EngineeringPages 253–254https://doi.org/10.1145/3578245.3585339Graph processing is increasingly popular given the wide range of phenomena represented as graphs (e.g., social media networks, pharmaceutical drug compounds, or fraud networks, among others). The increasing amount of data available requires new ...
- research-articleJune 2019
FishStore: Faster Ingestion with Subset Hashing
SIGMOD '19: Proceedings of the 2019 International Conference on Management of DataPages 1711–1728https://doi.org/10.1145/3299869.3319896The last decade has witnessed a huge increase in data being ingested into the cloud, in forms such as JSON, CSV, and binary formats. Traditionally, data is either ingested into storage in raw form, indexed ad-hoc using range indices, or cooked into ...
- extended-abstractOctober 2016
1st international workshop on multi-sensorial approaches to human-food interaction (workshop summary)
ICMI '16: Proceedings of the 18th ACM International Conference on Multimodal InteractionPages 601–603https://doi.org/10.1145/2993148.3007633This is an introductory paper for the workshop entitled ‘Multi-Sensorial Approaches to Human-Food Interaction’ held at ICMI 2016, which took place the 16th of November, 2016 in Tokyo, Japan. Here we discuss our objectives and the relevance of the ...
- research-articleNovember 2012
Web crawler middleware for search engine digital libraries: a case study for citeseerX
- Jian Wu,
- Pradeep Teregowda,
- Madian Khabsa,
- Stephen Carman,
- Douglas Jordan,
- Jose San Pedro Wandelmer,
- Xin Lu,
- Prasenjit Mitra,
- C. Lee Giles
WIDM '12: Proceedings of the twelfth international workshop on Web information and data managementPages 57–64https://doi.org/10.1145/2389936.2389949Middleware is an important part of many search engine web crawling processes. We developed a middleware, the Crawl Document Importer (CDI), which selectively imports documents and the associated metadata to the digital library CiteSeerX crawl repository ...