Top 10 Arxiv Papers Today in Digital Libraries


0.0 Mikeys
#1. Heuristics as conceptual lens for understanding and studying the usage of bibliometrics in research evaluation
Lutz Bornmann, Julian N. Marewski
While bibliometrics is widely used for research evaluation purposes, a common theoretical framework for conceptually understanding, empirically studying, and effectively teaching their usage is lacking. In this paper, we develop one such framework: the fast-and-frugal heuristics research program, proposed originally in the context of the cognitive and decision sciences, lends itself particularly well for understanding and investigating the usage of bibliometrics in research evaluations. Such evaluations represent judgments under uncertainty in which typically neither all possible outcomes, nor their consequences, and probabilities are known, knowable, or can be reliably estimated. In such situations of fuzzy and incomplete information, good descriptive and prescriptive models of human behavior are heuristics. Heuristics are simple strategies that, by exploiting the structure of environments, can aid people to make smart decisions. Relying on heuristics does not mean trading off accuracy against effort: while reducing...
more | pdf | html
Figures
Tweets
lutzbornmann: The proposal of heuristics as a common theoretical framework for conceptually understanding, empirically studying, and effectively teaching the usage of #bibliometrics https://t.co/cjKZkmYjTP
AndreaPolonioli: Very interesting preprint by @lutzbornmann and Julian N. Marewski on fast-and-frugal heuristics and bibliometrics: https://t.co/lagtxcjBE2
ComputerPapers: Heuristics as conceptual lens for understanding and studying the usage of bibliometrics in research evaluation. https://t.co/TKvpGZzTKY
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 2
Total Words: 15927
Unqiue Words: 4208

0.0 Mikeys
#2. Building and Querying Semantic Layers for Web Archives (Extended Version)
Pavlos Fafalios, Helge Holzmann, Vaibhav Kasturia, Wolfgang Nejdl
Web archiving is the process of collecting portions of the Web to ensure that the information is preserved for future exploitation. However, despite the increasing number of web archives worldwide, the absence of efficient and meaningful exploration methods still remains a major hurdle in the way of turning them into a usable and useful information source. In this paper, we focus on this problem and propose an RDF/S model and a distributed framework for building semantic profiles ("layers") that describe semantic information about the contents of web archives. A semantic layer allows describing metadata information about the archived documents, annotating them with useful semantic information (like entities, concepts and events), and publishing all this data on the Web as Linked Data. Such structured repositories offer advanced query and integration capabilities, and make web archives directly exploitable by other systems and tools. To demonstrate their query capabilities, we build and query semantic layers for three different...
more | pdf | html
Figures
Tweets
ComputerPapers: Building and Querying Semantic Layers for Web Archives (Extended Version). https://t.co/ur3sfB8blC
Github

Convert web archives to RDF triples with ArchiveSpark

Repository: ArchiveSpark2Triples
User: helgeho
Language: Jupyter Notebook
Stargazers: 0
Subscribers: 1
Forks: 1
Open Issues: 0
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 4
Total Words: 12967
Unqiue Words: 3595

0.0 Mikeys
#3. Finding Person Relations in Image Data of the Internet Archive
Eric Müller-Budack, Kader Pustu-Iren, Sebastian Diering, Ralph Ewerth
The multimedia content in the World Wide Web is rapidly growing and contains valuable information for many applications in different domains. For this reason, the Internet Archive initiative has been gathering billions of time-versioned web pages since the mid-nineties. However, the huge amount of data is rarely labeled with appropriate metadata and automatic approaches are required to enable semantic search. Normally, the textual content of the Internet Archive is used to extract entities and their possible relations across domains such as politics and entertainment, whereas image and video content is usually neglected. In this paper, we introduce a system for person recognition in image content of web news stored in the Internet Archive. Thus, the system complements entity recognition in text and allows researchers and analysts to track media coverage and relations of persons more precisely. Based on a deep learning face recognition approach, we suggest a system that automatically detects persons of interest and gathers sample...
more | pdf | html
Figures
Tweets
arxiv_org: Finding Person Relations in Image Data of the Internet Archive. https://t.co/ONgrQMJZur https://t.co/ExunmUGM8K
hauschke: RT @arxiv_org: Finding Person Relations in Image Data of the Internet Archive. https://t.co/ONgrQMJZur https://t.co/ExunmUGM8K
pathakraul: RT @arxiv_org: Finding Person Relations in Image Data of the Internet Archive. https://t.co/ONgrQMJZur https://t.co/ExunmUGM8K
Github
Repository: PIIA
User: TIB-Visual-Analytics
Language: JavaScript
Stargazers: 1
Subscribers: 2
Forks: 0
Open Issues: 0
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 4
Total Words: 4933
Unqiue Words: 1634

0.0 Mikeys
#4. CSIndexbr: Exploring the Brazilian Scientific Production in Computer Science
Marco Tulio Valente, Klérisson Paixão
CSIndexbr is a web-based system that provides meaningful,open,and transparent data about Brazilian scientific production in Computer Science. Currently, the system collects full research papers published in the main track of selected conferences. The papers are retrieved from DBLP. In this article, we describe the main features and resources provided by CSIndexbr. We also comment on how other researchers can use the data provided by the system to analyze the Brazilian production in Computer Science.
more | pdf | html
Figures
Tweets
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 2
Total Words: 923
Unqiue Words: 436

0.0 Mikeys
#5. Simulation Study on a New Peer Review Approach
Albert Steppi, Jinchan Qu, Minjing Tao, Tingting Zhao, Xiaodong Pang, Jinfeng Zhang
The increasing volume of scientific publications and grant proposals has generated an unprecedentedly high workload to scientific communities. Consequently, review quality has been decreasing and review outcomes have become less correlated with the real merits of the papers and proposals. A novel distributed peer review (DPR) approach has recently been proposed to address these issues. The new approach assigns principal investigators (PIs) who submitted proposals (or papers) to the same program as reviewers. Each PI reviews and ranks a small number (such as seven) of other PIs' proposals. The individual rankings are then used to estimate a global ranking of all proposals using the Modified Borda Count (MBC). In this study, we perform simulation studies to investigate several parameters important for the decision making when adopting this new approach. We also propose a new method called Concordance Index-based Global Ranking (CIGR) to estimate global ranking from individual rankings. An efficient simulated annealing algorithm is...
more | pdf | html
Figures
Tweets
M157q_News_RSS: Simulation Study on a New Peer Review Approach. (arXiv:1806.08663v2 [cs.DL] UPDATED) https://t.co/cSHYuxrKTt The increasing volume of scientific publications and grant proposals has generated an unprecedentedly high workload to scientific communities. Consequently, review quality
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 6
Total Words: 6838
Unqiue Words: 1885

0.0 Mikeys
#6. Data Mining in Scientometrics: usage analysis for academic publications
Olesya Mryglod, Yurij Holovatch, Ralph Kenna
We perform a statistical analysis of scientific-publication data with a goal to provide quantitative analysis of scientific process. Such an investigation belongs to the newly established field of scientometrics: a branch of the general science of science that covers all quantitative methods to analyze science and research process. As a case study we consider download and citation statistics of the journal `Europhysics Letters' (EPL), as Europe's flagship letters journal of broad interest to the physics community. While citations are usually considered as an indicator of academic impact, downloads reflect rather the level of attractiveness or popularity of a publication. We discuss peculiarities of both processes and correlations between them.
more | pdf | html
Figures
Tweets
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 3
Total Words: 4256
Unqiue Words: 1586

0.0 Mikeys
#7. The determinants of academic career advancement: evidence from Italy
Giovanni Abramo, Ciriaco Andrea D'Angelo, Francesco Rosati
In this work we investigate the determinants of professors' career advancement in Italian universities. From the analyses, it emerges that the fundamental determinant of an academic candidate's success is not scientific merit, but rather the number of years that the candidate has belonged to the same university as the selection committee president. Where applicants have participated in research work with the president, their probability of success also increases significantly. The factors of the years of service and occurrence of joint research for the other commission members also have an effect, however of lesser weight. The specific phenomenon of nepotism, although it exists, seems less important. The scientific quality of the commission members has negligible effect on the expected outcome of the competition, and even more so the geographic location of the university calling for the competition.
more | pdf | html
Figures
None.
Tweets
lutzbornmann: Nepotism in the Italian science system https://t.co/v4KtPjsOVl
angelosalatino: Evidence shows that academic career in Italy is determined more by who you know rather than your scientific merit. Analysis by Giovanni Abramo, Ciriaco Andrea D'Angelo, and @RosatiFran Paper: https://t.co/IYjCCFTRVC #ScienceOfScience #bibliometrics #nepotism https://t.co/JFoPERu7Sa
SauvonsMaScienc: @BrKloeckner @JulienGossa Et sur le sujet, vous sauriez dire si les conclusions de cette étude italienne s'appliqueraient au cas français (note : en math pas de le localisme) : The determinants of academic career advancement: evidence from Italy https://t.co/ByvWAYFazN
AlexUsherHESA: RT @lutzbornmann: Nepotism in the Italian science system https://t.co/v4KtPjsOVl
isidroaguillo: RT @lutzbornmann: Nepotism in the Italian science system https://t.co/v4KtPjsOVl
christianmunthe: RT @lutzbornmann: Nepotism in the Italian science system https://t.co/v4KtPjsOVl
schneiderleonid: RT @lutzbornmann: Nepotism in the Italian science system https://t.co/v4KtPjsOVl
innostudy: RT @lutzbornmann: Nepotism in the Italian science system https://t.co/v4KtPjsOVl
nrobinsongarcia: RT @lutzbornmann: Nepotism in the Italian science system https://t.co/v4KtPjsOVl
AndreaPolonioli: RT @lutzbornmann: Nepotism in the Italian science system https://t.co/v4KtPjsOVl
daforerog: RT @lutzbornmann: Nepotism in the Italian science system https://t.co/v4KtPjsOVl
biblio_metrics: RT @lutzbornmann: Nepotism in the Italian science system https://t.co/v4KtPjsOVl
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 3
Total Words: 10411
Unqiue Words: 3109

0.0 Mikeys
#8. Reviewing, indicating, and counting books for modern research evaluation systems
Alesia Zuccala, Nicolas Robinson-Garcia
In this chapter, we focus on the specialists who have helped to improve the conditions for book assessments in research evaluation exercises, with empirically based data and insights supporting their greater integration. Our review highlights the research carried out by four types of expert communities, referred to as the monitors, the subject classifiers, the indexers and the indicator constructionists. Many challenges lie ahead for scholars affiliated with these communities, particularly the latter three. By acknowledging their unique, yet interrelated roles, we show where the greatest potential is for both quantitative and qualitative indicator advancements in book-inclusive evaluation systems.
more | pdf | html
Figures
None.
Tweets
nrobinsongarcia: Now in Open Access, our forthcoming chapter "Reviewing, indicating and counting books for modern research evaluation systems" w @AlesiaZuccala https://t.co/rDZje1tO9s
ComputerPapers: Reviewing, indicating, and counting books for modern research evaluation systems. https://t.co/cLTl7i9N2Q
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 2
Total Words: 10245
Unqiue Words: 3302

0.0 Mikeys
#9. Measuring institutional research productivity for the life sciences: the importance of accounting for the order of authors in the byline
Giovanni Abramo, Ciriaco Andrea D'Angelo, Francesco Rosati
Accurate measurement of institutional research productivity should account for the real contribution of the research staff to the output produced in collaboration with other organizations. In the framework of bibliometric measurement, this implies accounting for both the number of co-authors and each individual's real contribution to scientific publications. Common practice in the life sciences is to indicate such contribution through the order of author names in the byline. In this work, we measure the distortion introduced to university-level bibliometric productivity rankings when the number of co-authors or their position in the byline is ignored. The field of observation consists of all Italian universities active in the life sciences (Biology and Medicine). The analysis is based on the research output of the university staff over the period 2004-2008. Based on the results, we recommend against the use of bibliometric indicators that ignore co-authorship and real contribution of each author to research outputs.
more | pdf | html
Figures
None.
Tweets
ComputerPapers: Measuring institutional research productivity for the life sciences: the importance of accounting for the order of authors in the byline. https://t.co/PuBwx0KMR7
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 3
Total Words: 6913
Unqiue Words: 1927

0.0 Mikeys
#10. Metadata Enrichment of Multi-Disciplinary Digital Library: A Semantic-based Approach
Hussein T. Al-Natsheh, Lucie Martinet, Fabrice Muhlenbach, Fabien Rico, Djamel A. Zighed
In the scientific digital libraries, some papers from different research communities can be described by community-dependent keywords even if they share a semantically similar topic. Articles that are not tagged with enough keyword variations are poorly indexed in any information retrieval system which limits potentially fruitful exchanges between scientific disciplines. In this paper, we introduce a novel experimentally designed pipeline for multi-label semantic-based tagging developed for open-access metadata digital libraries. The approach starts by learning from a standard scientific categorization and a sample of topic tagged articles to find semantically relevant articles and enrich its metadata accordingly. Our proposed pipeline aims to enable researchers reaching articles from various disciplines that tend to use different terminologies. It allows retrieving semantically relevant articles given a limited known variation of search terms. In addition to achieving an accuracy that is higher than an expanded query based method...
more | pdf | html
Figures
Tweets
arxiv_org: Metadata Enrichment of Multi-Disciplinary Digital Library: A Semantic-based Approach. https://t.co/cjjGaKi8zv https://t.co/Ekzsbamzys
Github

Scientific Topic Semantics Tagging

Repository: stst
User: ERICUdL
Language: OpenEdge ABL
Stargazers: 0
Subscribers: 0
Forks: 0
Open Issues: 0
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 5
Total Words: 5392
Unqiue Words: 1839

About

Assert is a website where the best academic papers on arXiv (computer science, math, physics), bioRxiv (biology), BITSS (reproducibility), EarthArXiv (earth science), engrXiv (engineering), LawArXiv (law), PsyArXiv (psychology), SocArXiv (social science), and SportRxiv (sport research) bubble to the top each day.

Papers are scored (in real-time) based on how verifiable they are (as determined by their Github repos) and how interesting they are (based on Twitter).

To see top papers, follow us on twitter @assertpub_ (arXiv), @assert_pub (bioRxiv), and @assertpub_dev (everything else).

To see beautiful figures extracted from papers, follow us on Instagram.

Tracking 72,893 papers.

Search
Sort results based on if they are interesting or reproducible.
Interesting
Reproducible
Categories
All
Astrophysics
Cosmology and Nongalactic Astrophysics
Earth and Planetary Astrophysics
Astrophysics of Galaxies
High Energy Astrophysical Phenomena
Instrumentation and Methods for Astrophysics
Solar and Stellar Astrophysics
Condensed Matter
Disordered Systems and Neural Networks
Mesoscale and Nanoscale Physics
Materials Science
Other Condensed Matter
Quantum Gases
Soft Condensed Matter
Statistical Mechanics
Strongly Correlated Electrons
Superconductivity
Computer Science
Artificial Intelligence
Hardware Architecture
Computational Complexity
Computational Engineering, Finance, and Science
Computational Geometry
Computation and Language
Cryptography and Security
Computer Vision and Pattern Recognition
Computers and Society
Databases
Distributed, Parallel, and Cluster Computing
Digital Libraries
Discrete Mathematics
Data Structures and Algorithms
Emerging Technologies
Formal Languages and Automata Theory
General Literature
Graphics
Computer Science and Game Theory
Human-Computer Interaction
Information Retrieval
Information Theory
Machine Learning
Logic in Computer Science
Multiagent Systems
Multimedia
Mathematical Software
Numerical Analysis
Neural and Evolutionary Computing
Networking and Internet Architecture
Other Computer Science
Operating Systems
Performance
Programming Languages
Robotics
Symbolic Computation
Sound
Software Engineering
Social and Information Networks
Systems and Control
Economics
Econometrics
General Economics
Theoretical Economics
Electrical Engineering and Systems Science
Audio and Speech Processing
Image and Video Processing
Signal Processing
General Relativity and Quantum Cosmology
General Relativity and Quantum Cosmology
High Energy Physics - Experiment
High Energy Physics - Experiment
High Energy Physics - Lattice
High Energy Physics - Lattice
High Energy Physics - Phenomenology
High Energy Physics - Phenomenology
High Energy Physics - Theory
High Energy Physics - Theory
Mathematics
Commutative Algebra
Algebraic Geometry
Analysis of PDEs
Algebraic Topology
Classical Analysis and ODEs
Combinatorics
Category Theory
Complex Variables
Differential Geometry
Dynamical Systems
Functional Analysis
General Mathematics
General Topology
Group Theory
Geometric Topology
History and Overview
Information Theory
K-Theory and Homology
Logic
Metric Geometry
Mathematical Physics
Numerical Analysis
Number Theory
Operator Algebras
Optimization and Control
Probability
Quantum Algebra
Rings and Algebras
Representation Theory
Symplectic Geometry
Spectral Theory
Statistics Theory
Mathematical Physics
Mathematical Physics
Nonlinear Sciences
Adaptation and Self-Organizing Systems
Chaotic Dynamics
Cellular Automata and Lattice Gases
Pattern Formation and Solitons
Exactly Solvable and Integrable Systems
Nuclear Experiment
Nuclear Experiment
Nuclear Theory
Nuclear Theory
Physics
Accelerator Physics
Atmospheric and Oceanic Physics
Applied Physics
Atomic and Molecular Clusters
Atomic Physics
Biological Physics
Chemical Physics
Classical Physics
Computational Physics
Data Analysis, Statistics and Probability
Physics Education
Fluid Dynamics
General Physics
Geophysics
History and Philosophy of Physics
Instrumentation and Detectors
Medical Physics
Optics
Plasma Physics
Popular Physics
Physics and Society
Space Physics
Quantitative Biology
Biomolecules
Cell Behavior
Genomics
Molecular Networks
Neurons and Cognition
Other Quantitative Biology
Populations and Evolution
Quantitative Methods
Subcellular Processes
Tissues and Organs
Quantitative Finance
Computational Finance
Economics
General Finance
Mathematical Finance
Portfolio Management
Pricing of Securities
Risk Management
Statistical Finance
Trading and Market Microstructure
Quantum Physics
Quantum Physics
Statistics
Applications
Computation
Methodology
Machine Learning
Other Statistics
Statistics Theory
Feedback
Online
Stats
Tracking 72,893 papers.