Arab Spring Twitter Dataset

http://doi.org/10.5281/zenodo.1185553 Arab-Spring Movement was a wave of anti-government demonstrations and protests which took place in a substantial number of middle-east ...
Read More

LMPP: LArge margin point process

https://doi.org/10.5281/zenodo.1163560 N/A LMPP: LArge margin point process IJCAI @article{samantalmpp, title={LMPP: A Large Margin Point Process Combining Reinforcement and Competition for ...
Read More

Smartphone Sensor Data of Indian Cities for Public Bus

http://dx.doi.org/10.17632/92yrxtv5gn.1 The data contains smartphone sensor data for public buses in three Indian cities, Kolkata, Bhubaneswar and Durgapur. The sensors ...
Read More

Improving Document Ranking for Long Queries with Nested Query Segmentation

https://zenodo.org/record/1137746#.WlOJybyWa00 "Improving Document Ranking for Long Queries with Nested Query Segmentation" 1. Query test of SGCL12 [SGCL12QueryTestSet.txt] 2. Outputs of ...
Read More

Code Borrowing Data Set

https://zenodo.org/record/835272#.WXl94ycvC1I The list of files we shared are - 1. EMNLPDataSet.csv 2. EMNLPUserDataSet.csv 3. GroundTruthLPF.csv 4. GroundTruthLPF_Old.csv 5. GroundTruthLPF_Young.csv 6 ...
Read More

Information extraction from microblogs posted during disasters

http://www.isical.ac.in/~fire/data/2016/FIRE2016-microblogs-track-data.tar.gz The dataset contains (i) tweet-ids of about 50,000 tweets posted during the 2015 Nepal earthquake, (ii) a set of ...
Read More

Building a Word Segmenter for Sanskrit Overnight

https://github.com/cvikasreddy/skt The software is a segmenter for sentences in Sanskrit. The tool performs Sandhi splitting of sentences such that the ...
Read More

A Dataset for Sanskrit Word Segmentation

https://zenodo.org/record/803508#.WTuKbSa9UUs The dataset contains about 115,000 sentences in Sanskrit. THe dataset can be used for word segmetnation task. For each ...
Read More