Top 10 Arxiv Papers Today in Computers And Society


2.004 Mikeys
#1. Estimating Glycemic Impact of Cooking Recipes via Online Crowdsourcing and Machine Learning
Helena Lee, Palakorn Achananuparp, Yue Liu, Ee-Peng Lim, Lav R. Varshney
Consumption of diets with low glycemic impact is highly recommended for diabetics and pre-diabetics as it helps maintain their blood glucose levels. However, laboratory analysis of dietary glycemic potency is time-consuming and expensive. In this paper, we explore a data-driven approach utilizing online crowdsourcing and machine learning to estimate the glycemic impact of cooking recipes. We show that a commonly used healthiness metric may not always be effective in determining recipes suitable for diabetics, thus emphasizing the importance of the glycemic-impact estimation task. Our best classification model, trained on nutritional and crowdsourced data obtained from Amazon Mechanical Turk (AMT), can accurately identify recipes which are unhealthful for diabetics.
more | pdf | html
Figures
None.
Tweets
arxiv_cs_LG: Estimating Glycemic Impact of Cooking Recipes via Online Crowdsourcing and Machine Learning. Helena Lee, Palakorn Achananuparp, Yue Liu, Ee-Peng Lim, and Lav R. Varshney https://t.co/IoTu20MLNS
arxiv_cscl: Estimating Glycemic Impact of Cooking Recipes via Online Crowdsourcing and Machine Learning https://t.co/HtKTUeXSRn
arxiv_cscl: Estimating Glycemic Impact of Cooking Recipes via Online Crowdsourcing and Machine Learning https://t.co/HtKTUeXSRn
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 5
Total Words: 0
Unqiue Words: 0

2.004 Mikeys
#2. Discovering Differential Features: Adversarial Learning for Information Credibility Evaluation
Lianwei Wu, Yuan Rao, Ambreen Nazir, Haolin Jin
A series of deep learning approaches extract a large number of credibility features to detect fake news on the Internet. However, these extracted features still suffer from many irrelevant and noisy features that restrict severely the performance of the approaches. In this paper, we propose a novel model based on Adversarial Networks and inspirited by the Shared-Private model (ANSP), which aims at reducing common, irrelevant features from the extracted features for information credibility evaluation. Specifically, ANSP involves two tasks: one is to prevent the binary classification of true and false information for capturing common features relying on adversarial networks guided by reinforcement learning. Another extracts credibility features (henceforth, private features) from multiple types of credibility information and compares with the common features through two strategies, i.e., orthogonality constraints and KL-divergence for making the private features more differential. Experiments first on two six-label LIAR and Weibo...
more | pdf | html
Figures
None.
Tweets
BrundageBot: Discovering Differential Features: Adversarial Learning for Information Credibility Evaluation. Lianwei Wu, Yuan Rao, Ambreen Nazir, and Haolin Jin https://t.co/iogcg93xZG
arxiv_cs_LG: Discovering Differential Features: Adversarial Learning for Information Credibility Evaluation. Lianwei Wu, Yuan Rao, Ambreen Nazir, and Haolin Jin https://t.co/FeaxRS53Xb
arxiv_cscl: Discovering Differential Features: Adversarial Learning for Information Credibility Evaluation https://t.co/L9YPOhPD3b
arxiv_cscl: Discovering Differential Features: Adversarial Learning for Information Credibility Evaluation https://t.co/L9YPOhPD3b
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 4
Total Words: 12968
Unqiue Words: 3536

2.003 Mikeys
#3. Machine learning in healthcare -- a system's perspective
Awais Ashfaq, Slawomir Nowaczyk
A consequence of the fragmented and siloed healthcare landscape is that patient care (and data) is split along multitude of different facilities and computer systems and enabling interoperability between these systems is hard. The lack interoperability not only hinders continuity of care and burdens providers, but also hinders effective application of Machine Learning (ML) algorithms. Thus, most current ML algorithms, designed to understand patient care and facilitate clinical decision-support, are trained on limited datasets. This approach is analogous to the Newtonian paradigm of Reductionism in which a system is broken down into elementary components and a description of the whole is formed by understanding those components individually. A key limitation of the reductionist approach is that it ignores the component-component interactions and dynamics within the system which are often of prime significance in understanding the overall behaviour of complex adaptive systems (CAS). Healthcare is a CAS. Though the application of...
more | pdf | html
Figures
None.
Tweets
arxiv_cs_LG: Machine learning in healthcare -- a system's perspective. Awais Ashfaq and Slawomir Nowaczyk https://t.co/YVE6RLkVs8
SantchiWeb: RT @arxiv_cs_LG: Machine learning in healthcare -- a system's perspective. Awais Ashfaq and Slawomir Nowaczyk https://t.co/YVE6RLkVs8
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 2
Total Words: 2814
Unqiue Words: 1355

1.998 Mikeys
#4. Student Performance Prediction with Optimum Multilabel Ensemble Model
Ephrem Admasu Yekun, Abrahaley Teklay
One of the important measures of quality of education is the performance of students in the academic settings. Nowadays, abundant data is stored in educational institutions about students which can help to discover insight on how students are learning and how to improve their performance ahead of time using data mining techniques. In this paper, we developed a student performance prediction model that predicts the performance of high school students for the next semester for five courses. We modeled our prediction system as a multi-label classification task and used support vector machine (SVM), Random Forest (RF), K-nearest Neighbors (KNN), and Mult-layer perceptron (MLP) as base-classifiers to train our model. We further improved the performance of the prediction model using state-of-the-art partitioning schemes to divide the label space into smaller spaces and use Label Powerset (LP) transformation method to transform each labelset into a multi-class classification task. The proposed model achieved better performance in terms...
more | pdf | html
Figures
Tweets
ephraimAdmassu: Thank you @arxiv_org ! https://t.co/pDmhCmw9S3
Fetu_Ethio: RT @ephraimAdmassu: Thank you @arxiv_org ! https://t.co/pDmhCmw9S3
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 2
Total Words: 5902
Unqiue Words: 2014

1.997 Mikeys
#5. A Case Study of Spreadsheet Use within the Finance and Academic Registry units within a Higher Education Institution
Simon Thorne, Jamie Hancock
This paper presents the findings of a case study of spreadsheet use in a higher education institution in the UK. The paper considers the use of spreadsheets in two units of the organisation, academic registry and finance. Spreadsheet use is explored in terms of importance, training, experience, purpose, techniques deployed, size of spreadsheets created and sharing of spreadsheets. The implications of the results are then considered in terms of accurate reporting to external funding bodies such the funding councils, internal data integrity and internal data efficiencies. The results show a large volume of spreadsheets being created and used, that the profile of spreadsheet developers is typical of other studies of spreadsheet use and the need for the organisation to have clear principles and guidelines for the development of spreadsheet models in the organisation to ensure data integrity, reduce duplication of effort and to optimise the use of spreadsheets to meet the institutions goals.
more | pdf | html
Figures
None.
Tweets
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 2
Total Words: 7346
Unqiue Words: 2024

1.997 Mikeys
#6. On the design of an innovative solution for increasing hazardous materials transportation safety
Emil Pricop
Transportation of hazardous materials represent a high risk operation all over the world. Flammable substances such as oil, kerosene, hydrocarbons, ammonium nitrate or toxic products are shipped every day on busy roads by trucks. An innovative solution for increasing hazardous materials transportation safety is presented in this paper. The solution integrates three systems: one mounted on the truck that can alert authorities in case of an accident, one portable system for quick identification of the carried substances and intervention method and a component for real-time road monitoring. The proposed solution is based on RFID card with a special memory structure presented in this paper
more | pdf | html
Figures
None.
Tweets
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 1
Total Words: 0
Unqiue Words: 0

1.997 Mikeys
#7. A prototype for a serious digital game to teach linguistic ontologies
Diana Medina, Grissa Maturana, Fernán Villa
The objective of ontologies is to increase the compression of a given domain by eliminating interpretation problems. Among kinds of ontologies are linguistics ontologies which are ontologies used to simplify the interface between domain knowledge and linguistic components. Digital games have received increasing interest from educators in recent years for their potential to enhance the language learning and linguistic learning experience. Within the literature are games to teach ontologies of a specific domain, and games that use ontologies to facilitate the understanding of a given domain. Other educational games teach linguistics or vocabulary in contexts in which language is useful and meaningful. Although games help to understand difficult topics, the use of games that seek to meet the learning objectives of linguistics is not very popular and those focused on teaching linguistic ontologies are scarce. To solve the lack of the recreational resource for teaching linguistics in this document a prototype of a digital game called...
more | pdf | html
Figures
None.
Tweets
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 3
Total Words: 0
Unqiue Words: 0

1.997 Mikeys
#8. Modeling Traffic Congestion with Spatiotemporal Big Data for An Intelligent Freeway Monitoring System
Karisma Trinanda Putra, Jing-Doo Wang, Eko Prasetyo, Prayitno
Traffic congestion is a complex, nonlinear spatiotemporal modeling problem. By collecting and analyzing a vast quantity and different categories of information, traffic flow, and road congestion can be predicted and controlled on an intelligent transportation system. This report provides an analysis of traveling time across Taiwan from North to South, vice versa. We analyze traffic in a national freeway between Tainan and Kaohsiung section, which represents the common trip of the population in Southern Taiwan. The data is recorded using the Electronic Toll Collection System (ETC) provided by Ministry of Transportation in Taiwan. We use MapReduce framework to process data into a smaller task which can be distributed on several computer clusters to speed up the process. The results show that the spatiotemporal model of traffic flow is strongly influenced by direction, working hour, and holidays with a recurring pattern for each week. The distinctive pattern inside the spatiotemporal dataset can be used on an AI-powered...
more | pdf | html
Figures
None.
Tweets
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 4
Total Words: 0
Unqiue Words: 0

1.997 Mikeys
#9. A Forensic Qualitative Analysis of Contributions to Wikipedia from Anonymity Seeking Users
Kaylea Champion, Nora McDonald, Stephanie Bankes, Joseph Zhang, Rachel Greenstadt, Andrea Forte, Benjamin Mako Hill
By choice or by necessity, some contributors to commons-based peer production sites use privacy-protecting services to remain anonymous. As anonymity seekers, users of the Tor network have been cast both as ill-intentioned vandals and as vulnerable populations concerned with their privacy. In this study, we use a dataset drawn from a corpus of Tor edits to Wikipedia to uncover the character of Tor users' contributions. We build in-depth narrative descriptions of Tor users' actions and conduct a thematic analysis that places their editing activity into seven broad groups. We find that although their use of a privacy-protecting service marks them as unusual within Wikipedia, the character of many Tor users' contributions is in line with the expectations and norms of Wikipedia. However, our themes point to several important places where lack of trust promotes disorder, and to contributions where risks to contributors, service providers, and communities are unaligned.
more | pdf | html
Figures
Tweets
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 7
Total Words: 15533
Unqiue Words: 4124

1.994 Mikeys
#10. Don't cross that stop line: Characterizing Traffic Violations in Metropolitan Cities
Shashank Srikanth, Aanshul Sadaria, Himanshu Bhatia, Kanay Gupta, Pratik Jain, Ponnurangam Kumaraguru
In modern metropolitan cities, the task of ensuring safe roads is of paramount importance. Automated systems of e-challans (Electronic traffic-violation receipt) are now being deployed across cities to record traffic violations and to issue fines. In the present study, an automated e-challan system established in Ahmedabad (Gujarat, India) has been analyzed for characterizing user behaviour, violation types as well as finding spatial and temporal patterns in the data. We describe a method of collecting e-challan data from the e-challan portal of Ahmedabad traffic police and create a dataset of over 3 million e-challans. The dataset was first analyzed to characterize user behaviour with respect to repeat offenses and fine payment. We demonstrate that a lot of users repeat their offenses (traffic violation) frequently and are less likely to pay fines of higher value. Next, we analyze the data from a spatial and temporal perspective and identify certain spatio-temporal patterns present in our dataset. We find that there is a drastic...
more | pdf | html
Figures
Tweets
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 6
Total Words: 6897
Unqiue Words: 1923

About

Assert is a website where the best academic papers on arXiv (computer science, math, physics), bioRxiv (biology), BITSS (reproducibility), EarthArXiv (earth science), engrXiv (engineering), LawArXiv (law), PsyArXiv (psychology), SocArXiv (social science), and SportRxiv (sport research) bubble to the top each day.

Papers are scored (in real-time) based on how verifiable they are (as determined by their Github repos) and how interesting they are (based on Twitter).

To see top papers, follow us on twitter @assertpub_ (arXiv), @assert_pub (bioRxiv), and @assertpub_dev (everything else).

To see beautiful figures extracted from papers, follow us on Instagram.

Tracking 192,914 papers.

Search
Sort results based on if they are interesting or reproducible.
Interesting
Reproducible
Categories
All
Astrophysics
Cosmology and Nongalactic Astrophysics
Earth and Planetary Astrophysics
Astrophysics of Galaxies
High Energy Astrophysical Phenomena
Instrumentation and Methods for Astrophysics
Solar and Stellar Astrophysics
Condensed Matter
Disordered Systems and Neural Networks
Mesoscale and Nanoscale Physics
Materials Science
Other Condensed Matter
Quantum Gases
Soft Condensed Matter
Statistical Mechanics
Strongly Correlated Electrons
Superconductivity
Computer Science
Artificial Intelligence
Hardware Architecture
Computational Complexity
Computational Engineering, Finance, and Science
Computational Geometry
Computation and Language
Cryptography and Security
Computer Vision and Pattern Recognition
Computers and Society
Databases
Distributed, Parallel, and Cluster Computing
Digital Libraries
Discrete Mathematics
Data Structures and Algorithms
Emerging Technologies
Formal Languages and Automata Theory
General Literature
Graphics
Computer Science and Game Theory
Human-Computer Interaction
Information Retrieval
Information Theory
Machine Learning
Logic in Computer Science
Multiagent Systems
Multimedia
Mathematical Software
Numerical Analysis
Neural and Evolutionary Computing
Networking and Internet Architecture
Other Computer Science
Operating Systems
Performance
Programming Languages
Robotics
Symbolic Computation
Sound
Software Engineering
Social and Information Networks
Systems and Control
Economics
Econometrics
General Economics
Theoretical Economics
Electrical Engineering and Systems Science
Audio and Speech Processing
Image and Video Processing
Signal Processing
General Relativity and Quantum Cosmology
General Relativity and Quantum Cosmology
High Energy Physics - Experiment
High Energy Physics - Experiment
High Energy Physics - Lattice
High Energy Physics - Lattice
High Energy Physics - Phenomenology
High Energy Physics - Phenomenology
High Energy Physics - Theory
High Energy Physics - Theory
Mathematics
Commutative Algebra
Algebraic Geometry
Analysis of PDEs
Algebraic Topology
Classical Analysis and ODEs
Combinatorics
Category Theory
Complex Variables
Differential Geometry
Dynamical Systems
Functional Analysis
General Mathematics
General Topology
Group Theory
Geometric Topology
History and Overview
Information Theory
K-Theory and Homology
Logic
Metric Geometry
Mathematical Physics
Numerical Analysis
Number Theory
Operator Algebras
Optimization and Control
Probability
Quantum Algebra
Rings and Algebras
Representation Theory
Symplectic Geometry
Spectral Theory
Statistics Theory
Mathematical Physics
Mathematical Physics
Nonlinear Sciences
Adaptation and Self-Organizing Systems
Chaotic Dynamics
Cellular Automata and Lattice Gases
Pattern Formation and Solitons
Exactly Solvable and Integrable Systems
Nuclear Experiment
Nuclear Experiment
Nuclear Theory
Nuclear Theory
Physics
Accelerator Physics
Atmospheric and Oceanic Physics
Applied Physics
Atomic and Molecular Clusters
Atomic Physics
Biological Physics
Chemical Physics
Classical Physics
Computational Physics
Data Analysis, Statistics and Probability
Physics Education
Fluid Dynamics
General Physics
Geophysics
History and Philosophy of Physics
Instrumentation and Detectors
Medical Physics
Optics
Plasma Physics
Popular Physics
Physics and Society
Space Physics
Quantitative Biology
Biomolecules
Cell Behavior
Genomics
Molecular Networks
Neurons and Cognition
Other Quantitative Biology
Populations and Evolution
Quantitative Methods
Subcellular Processes
Tissues and Organs
Quantitative Finance
Computational Finance
Economics
General Finance
Mathematical Finance
Portfolio Management
Pricing of Securities
Risk Management
Statistical Finance
Trading and Market Microstructure
Quantum Physics
Quantum Physics
Statistics
Applications
Computation
Methodology
Machine Learning
Other Statistics
Statistics Theory
Feedback
Online
Stats
Tracking 192,914 papers.