We present the design and methodology for the large scale hybrid paper
recommender system used by Microsoft Academic. The system provides
recommendations for approximately 160 million English research papers and
patents. Our approach handles incomplete citation information while also
alleviating the cold-start problem that often affects other recommender
systems. We use the Microsoft Academic Graph (MAG), titles, and available
abstracts of research papers to build a recommendation list for all documents,
thereby combining co-citation and content based approaches. Tuning system
parameters also allows for blending and prioritization of each approach which,
in turn, allows us to balance paper novelty versus authority in recommendation
results. We evaluate the generated recommendations via a user study of 40
participants, with over 2400 recommendation pairs graded and discuss the
quality of the results using P@10 and nDCG scores. We see that there is a
strong correlation between participant scores and the similarity rankings
produced...

more |
pdf
| html
arxiv_org:
A Scalable Hybrid Research Paper Recommender System for Microsoft Academic. https://t.co/MCWGCsl9TU https://t.co/JmQBrprc4N

arxivml:
"A Scalable Hybrid Research Paper Recommender System for Microsoft Academic",
Anshul Kanakia, Zhihong Shen, Darrin …
https://t.co/ucxCGxiGj7

The raw data and analysis code for the Microsoft Academic paper recommender system user study conducted in 2018.

Stargazers: 1

Subscribers: 1

Subscribers: 1

Forks: 0

Open Issues: 0

Open Issues: 0

None.

Sample Sizes : None.

Authors: 4

Total Words: 5543

Unqiue Words: 1931

We propose the use of beamplots - which can be produced by using the R
package BibPlots and WoS downloads - as a preferred alternative to h index
values for assessing single researchers.

more |
pdf
| html
RHaunschild:
Searching for an alternative to the h index? Display the (age-weighted) citation distribution in a beamplot: https://t.co/Vw1l6XlxBG
The R package BibPlots (https://t.co/zW3FRFntK2) provides an easy to use function for this. #bibliometrics #R #CRAN https://t.co/lvq6bku3hc

None.

None.

Sample Sizes : None.

Authors: 3

Total Words: 1058

Unqiue Words: 485

Nowadays, Machine Learning (ML) is seen as the universal solution to improve
the effectiveness of information retrieval (IR) methods. However, while
mathematics is a precise and accurate science, it is usually expressed by less
accurate and imprecise descriptions, contributing to the relative dearth of
machine learning applications for IR in this domain. Generally, mathematical
documents communicate their knowledge with an ambiguous, context-dependent, and
non-formal language. Given recent advances in ML, it seems canonical to apply
ML techniques to represent and retrieve mathematics semantically. In this work,
we apply popular text embedding techniques to the arXiv collection of STEM
documents and explore how these are unable to properly understand mathematics
from that corpus. In addition, we also investigate the missing aspects that
would allow mathematics to be learned by computers.

more |
pdf
| html
None.

BrundageBot:
Why Machines Cannot Learn Mathematics, Yet. André Greiner-Petter, Terry Ruas, Moritz Schubotz, Akiko Aizawa, William Grosky, and Bela Gipp https://t.co/pP9d6LGg48

SciFi:
Why Machines Cannot Learn Mathematics, Yet. https://t.co/PA0icrGv7k

hiropon_matsu:
"Why Machines Cannot Learn Mathematics, Yet." https://t.co/j0JZ5Jrmmf

tak_yamm:
RT @hiropon_matsu: "Why Machines Cannot Learn Mathematics, Yet." https://t.co/j0JZ5Jrmmf

None.

None.

Sample Sizes : None.

Authors: 6

Total Words: 6519

Unqiue Words: 2219

Assert is a website where the best academic papers on arXiv (computer science, math, physics), bioRxiv (biology), BITSS (reproducibility), EarthArXiv (earth science), engrXiv (engineering), LawArXiv (law), PsyArXiv (psychology), SocArXiv (social science), and SportRxiv (sport research) bubble to the top each day.

Papers are scored (in real-time) based on how verifiable they are (as determined by their Github repos) and how interesting they are (based on Twitter).

To see top papers, follow us on twitter @assertpub_ (arXiv), @assert_pub (bioRxiv), and @assertpub_dev (everything else).

To see beautiful figures extracted from papers, follow us on Instagram.

*Tracking 131,277 papers.*

Sort results based on if they are interesting or reproducible.

Interesting

Reproducible