Synopsis Report
Synopsis Report
Synopsis Report
ON
FileFlow
Submitted in partial fulfilment of the requirements for the award of the degree
of
BACHELOR OF TECHNOLOGY
in
by
Guided by
Assistant Professor
DELHI – 110058
2020-2024
TABLE OF CONTENTS
Abstract …………………………………………………………………………...… 1
Introduction ………………………………………………………………………… 2
Objective ……………………………………………………………………….…… 4
Limitations …………………………………………………………………………...6
References ………………………………………………...………………………….7
ABSTRACT
In the realm of Artificial Intelligence, our project, FileFlow aims to introduce a Large
Language Model (LLM) based system designed to enhance conversational AI. While
our primary focus is on enabling meaningful interactions, we also aim to integrate the
capability to navigate through a diverse range of document formats. This synthesis
focuses on providing users with a versatile AI assistant that caters to both casual
conversations and document-related tasks.
This report delves into the cutting-edge technologies driving a paradigm shift in
information interaction, transcending traditional document management approaches.
Leveraging advanced natural language processing (NLP) algorithms, users can engage
in dynamic conversations with their files, extracting insights, asking questions, and
gaining a deeper understanding of their content.
Our project aims to develop a user-friendly tool called FileFlow, designed to facilitate
easy communication and comprehension of various file formats such as documents,
presentations, spreadsheets, and more. This project will serve as a conversational
platform, allowing individuals to interact with their files as if they were chatting with
a knowledgeable friend.
The primary objective is to create a seamless experience where users can engage in
natural conversations with their documents, asking questions, seeking clarification,
and extracting relevant information effortlessly. We want to eliminate the complexity
typically associated with navigating and understanding diverse file types, making it
accessible to anyone, regardless of technical expertise.
Additionally, this project will incorporate features that promote casual conversation
and learning, including the ability to chat with an AI companion. This AI friend will
offer assistance, provide explanations, and engage users in friendly discussions,
making the process of working with documents both enjoyable and educational.
A chatbot is an artificially intelligent creature which can converse with humans. This
could be text-based, or a spoken conversation (in case of voice-based queries). Chat
bots are basically used for information acquisition. It can run on the local PCs and
mobile phones, though most of the time it is accessed through the internet. It is a
conversational agent which interacts with users in a certain domain or on a particular
topic with input in natural language sentences. Mainly a chatbot works by a user
asking some question or initiating a new topic of discussion.[5] a paper by Tarun
Lalwani, Shashank Bhalotia conveyed Chat bots can be referred as software agents
that pretend as human entity. These are the agents with AI embedded and using NLP
they can answer to user questions. Predefined knowledge base helps develop a
response to the query [6].
Chatbot design [7] typically relies on parsing techniques, pattern matching strategies
and Natural Language Processing (NLP) to represent the chatbot knowledge. The
latter is the dominant technique thanks to the popularization of libraries and cloud-
based services such as DialogFlow [8] or IBM Watson Assistant [9], which rely on
Machine 4 Learning (ML) techniques to understand the user input (based on a set of
training sentences provided as part of the chatbot definition) and provide user-friendly
interfaces to design the conversational flow.
This project isn't about grandiosity but rather about creating a tool that seamlessly
integrates into users' workflows, providing practical utility in both casual
conversations and document-related tasks.
METHODOLOGY OF STUDY
1. Natural Language Processing Accuracy: While the project aims to enable natural
conversations with documents, the accuracy of understanding and responding to user
queries in natural language is a significant challenge. Natural Language Processing
(NLP) technologies, though advanced, may struggle with interpreting complex,
ambiguous, or contextually rich queries, leading to inaccurate or irrelevant responses.
2. Data Privacy and Security: Given that users will interact with personal and
potentially sensitive documents, ensuring data privacy and security is paramount. The
system must securely handle, process, and store documents to protect against
unauthorized access, data breaches, or leaks. Building user trust requires rigorous data
protection measures and transparency about how data is used and safeguarded.
4. Website Hyperlink Parsing Limitation: When interacting with websites, the system
may encounter limitations in parsing hyperlinks within the page content. Parsing only
the specified link provided by the user may restrict the depth of content analysis and
limit the system's ability to access additional relevant information or context from
linked pages.
[1] Tarun Lalwani, Shashank Bhalotia, Ashish Pal, Shreya Bisen, Vasundhara Rathod,
IEEE International Conference on Computational Systems and Information
Technology for Sustainable Solutions 2017.
[3] Amey Tiwari, Rahul Talekar, Prof. S. M. Patil, “College Information Chatbot
System”, International Journal of Engineering Research and General Science, Volume
2, Issue 2, April 2017.
[4] C. M. Bishop. Pattern recognition and machine learning. Springer New York, 2006
[6] Kreimeyer K., Foster M., Pandey A., Arya N., Halford G., Jones S.F., Forshee R.,
Walderhaug M., Botsis T. Natural language processing systems for capturing and
standardizing unstructured clinical information: A systematic review. J. Biomed.
Inform. 2017;73:14–29. doi: 10.1016/j.jbi.2017.07.012. [PMC free article] [PubMed]
[CrossRef] [Google Scholar]
[7] Chowdhury G.G. Natural language processing. Annu. Rev. Inf. Sci.
Technol. 2003;37:51–89. doi: 10.1002/aris.1440370103. [CrossRef] [Google Scholar]
[8] Choudhary N (2021) LDC-IL: the Indian repository of resources for language
technology. Lang Resources & Evaluation 55:855–867. [google scholar]
[10] Baud RH, Rassinoux AM, Scherrer JR (1992) Natural language processing and
semantical representation of medical texts. Methods Inf Med 31(2):117–125