17Counterfeit Pharmaceutical Drug Identification

Sajidha S. A.1*, Aakif Mairaj2, Amit Kumar Tyagi3, A. Vijayalakshmi1, Nisha V. M.1, Siddharth Nair1, C.K.M. Ganesan1, Ram Gunasekaran1 and Hitarth Menon1

1School of Computer Science and Engineering, Vellore Institute of Technology, Chennai, India

2School of Sciences, Indiana University Kokomo, Kokomo, Indiana, USA

3Department of Fashion Technology, National Institute of Fashion Technology, New Delhi, India

Abstract

Counterfeit drug selling is an extremely dangerous problem around the world, which has escalated in the recent past. It has caused millions of deaths and serious complications in the health of people who consume them. It is not possible for the common man to detect subtle differences in the packaging and the back-strip labels of drugs. This paper proposes an NLP-based optical character recognition-based algorithm, coupled with named entity recognition and hashed database searching to identify and mark pharmaceutical drugs as valid or counterfeit. Optical character recognition using a Tesseract engine would identify the contents of the back-strip of drugs. At the second stage, these contents will pass through a custom named entity tagger to identify the drug name. The third stage will involve the extraction of relevant information from the database to perform a series of validation techniques on the extracted drug data. On processing and confirming these parameters, the drugs will be classified as counterfeit or valid. ...

Get Conversational Artificial Intelligence now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.