5FLASH: Web-Form’s Logical Analysis & Session Handling Automatic Form Classification and Filling on Surface and Dark Web

Ashwini Dalvi1*, Viraj Thakkar2, Smit Moradiya2, Aditya Vedpathak2, Irfan Siddavatam2, Fark Kazi1 and S.G. Bhirud1

1Veermata Jijabai Technological Institute, Mumbai, India

2K. J. Somaiya College of Engineering, Mumbai, India

Abstract

Data collection and mining has quickly established itself as a topic of interest, with a focus on openly available data already used extensively. The recent years have turned attention to the dark web due to its unmonitored nature, especially in regard to data behind sign up or registration pages, access to which is not yet automated. With present work already developed on the structure and semantics of a web form, researchers categorize forms as search forms and non-search forms. This paper furthers this by proposing an automated process that determines the more common types of form (Search, Login, Registration, Other) and further demonstrates an automated method to fill and submit forms on the surface and dark web. To achieve this, we perform the interpretation of the form tags and further maintain a headless session for low system overheads. The approach suggested and demonstrated in this paper has been tested for classification and filling (including login after registration) on over 2000 different forms with 84.8% accuracy for classification and 61.1% accuracy for filling.

Keywords: Security automation, robotic process application, ...

Get Robotic Process Automation now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.