Programming

Learning Phrases with PyLucene and Pytorch, part 2.

In part 2 we reuse our tokenised index and use pytorch to build a model for significant phrase extraction. It worked surprisingly well and being able to switch Analyzers proved useful. We found that the English Analyzer with stopword removal and stemming worked best.

The results are indicative, neither the dataset size or the length of training cycles are sufficient for the development of a genralised phrase extractor but the succcess and ovelap found between pylucene and pytorch is very encouraging. We just need to scale it up.

Continue reading

Areas of Expertise

At ACUMED Consulting, we specialise in using data science to drive progress, innovation and success within businesses. We have concentrated experience in Gaming Analytics, but have worked in cybersecurity, fraud detection, marketing, banking, insurance, telecommunications and government sectors too. Our experts are skilled in a range of data science and analytics areas including:

Analytical Methods and Problem Solving

Reducing dimensions Time series analysis Anomaly detection - finding unusual behaviour

Machine Learning

Continue reading