awacke1 commited on
Commit
4bda7c1
·
verified ·
1 Parent(s): 90bb105

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +58 -0
README.md CHANGED
@@ -9,6 +9,64 @@ app_file: app.py
9
  pinned: false
10
  license: mit
11
  ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
12
  # 7/9 - evaluate new GPT models
13
 
14
  GPT-4o Documentation: https://cookbook.openai.com/examples/gpt4o/introduction_to_gpt4o
 
9
  pinned: false
10
  license: mit
11
  ---
12
+
13
+
14
+ | 🧩 **Category** | **Info** |
15
+ |---------------------------------------|---------------------------------------------------------------------------------------------------------------------------------------------------------------|
16
+ | 🛠️ **Libraries & Modules Used** | **Web & UI**: streamlit, streamlit.components.v1 |
17
+ | | **AI & API Integration**: openai, gradio_client |
18
+ | | **File Handling**: base64, os, glob, zipfile, textract, PyPDF2 |
19
+ | | **Image & Video Processing**: cv2, moviepy, PIL |
20
+ | | **Text Processing**: re, BeautifulSoup, pandas |
21
+ | | **Time & Date**: datetime, pytz |
22
+ | | **Utilities & Concurrency**: concurrent.futures, ThreadPoolExecutor, tqdm |
23
+ | | **Speech & Audio**: audio_recorder_streamlit |
24
+ | ⚙️ **App Configuration** | **Site Name**: Scholarly-Article-Document-Search-With-Memory |
25
+ | | **Page Title/Icon**: 🔬🧠ScienceBrain.AI, custom icon file (icons.ico) |
26
+ | | **Sidebar**: Save session checkbox (should_save) |
27
+ | 🗣️ **Core Functionalities** | **Text Interaction**: |
28
+ | | - Chat-based prompts using GPT-4o. |
29
+ | | - Saves conversations as Markdown files (md). |
30
+ | | - Speech Synthesis (HTML5) |
31
+ | | - Embedded JavaScript function (SpeechSynthesis) to read aloud content. |
32
+ | | **Image Interaction**: |
33
+ | | - Upload image, base64 encode, and analyze via GPT-4o. |
34
+ | | - Results stored as Markdown (md) with filenames including prompts and timestamps. |
35
+ | | **Audio Interaction**: |
36
+ | | - Upload or record audio. |
37
+ | | - Audio transcribed using Whisper, summarized or analyzed via GPT-4o, and responses stored as Markdown files. |
38
+ | | **Video Interaction**: |
39
+ | | - Extracts frames and audio from videos. |
40
+ | | - Transcribes audio track, summarizes via GPT-4o. |
41
+ | | - Markdown files created with summarized content. |
42
+ | 📚 **Advanced Document Handling** | **Vector Stores & PDF Galleries**: |
43
+ | | - Allows upload of multiple PDF files, generating quizzes, summaries, or key facts. |
44
+ | | - Vector stores created and managed for RAG querying. |
45
+ | | - Arxiv scholarly search integration through Hugging Face Gradio API (awacke1/Arxiv-Paper-Search-And-QA-RAG-Pattern). |
46
+ | | **RAG (Retrieval-Augmented Generation)**: |
47
+ | | - Performs semantic search on uploaded PDFs. |
48
+ | | - Evaluates retrieval performance metrics (recall@k, mrr, map). |
49
+ | 🗃️ **File Management** | **Sidebar File Manager**: |
50
+ | | - Filter files (.md, .pdf, .png, etc.). |
51
+ | | - Operations: View, Edit, Download, Run, Delete individual or bulk actions. |
52
+ | | - Zip download functionality for filtered files. |
53
+ | 🛠️ **Helper Functions & Utilities** | **Filename Generation (generate_filename)**: |
54
+ | | - Date/time, prompt sanitized for filesystem. |
55
+ | | **File Saving (create_and_save_file)**: |
56
+ | | - Conditional file writing based on checkbox option. |
57
+ | | **Concurrent PDF uploads**: |
58
+ | | - Managed via ThreadPoolExecutor with progress tracking. |
59
+ | 🎨 **UI Enhancements** | - Chat-style user-assistant interactions. |
60
+ | | - Streamlit's columns for organized interface and clean button prompts. |
61
+ | | - Dynamic HTML and JavaScript embeds for rich user interaction. |
62
+ | 🌐 **Environmental Variables** | - **Required variables**: |
63
+ | | - API_KEY |
64
+ | | - HF_KEY (Hugging Face) |
65
+ | | - OPENAI_API_KEY |
66
+ | | - OPENAI_ORG_ID |
67
+ | 📝 **Note** | The app robustly integrates multimodal capabilities (text, audio, video, images) and leverages OpenAI and Hugging Face resources extensively to provide a rich, interactive AI-driven document management and analysis experience. |
68
+
69
+
70
  # 7/9 - evaluate new GPT models
71
 
72
  GPT-4o Documentation: https://cookbook.openai.com/examples/gpt4o/introduction_to_gpt4o