Kunal Pai commited on
Commit
58408af
·
1 Parent(s): adfd14d

Memory function description added to paper

Browse files
paper/conference_101719.tex CHANGED
@@ -161,7 +161,7 @@ Explicit resource management is central, moving beyond simple API cost tracking.
161
  \item \textbf{Memory Usage:} Footprint of active Employee agents (\% of allocated budget).
162
  \item \textbf{Agent Concurrency:} Count of concurrently active agents.
163
  \end{itemize}
164
- Metrics are monitored against predefined \textbf{budget limits}. Actions (like hiring) exceeding limits (e.g., >90\% memory, exceeding max concurrency) are prevented. This ensures operation within constraints, crucial for limited resources or strict budgets.
165
 
166
  \subsection{Tool Utilization and Autonomous Creation}
167
  \label{subsec:tooling}
@@ -176,6 +176,13 @@ A distinctive feature is \textbf{integrated, autonomous tool creation}. If the C
176
  \end{enumerate}
177
  This allows HASHIRU to dynamically extend its functional repertoire, tailoring capabilities to tasks without manual intervention, enabling greater autonomy and adaptation.
178
 
 
 
 
 
 
 
 
179
  \section{Experimental Setup}
180
  \label{sec:experiments}
181
 
 
161
  \item \textbf{Memory Usage:} Footprint of active Employee agents (\% of allocated budget).
162
  \item \textbf{Agent Concurrency:} Count of concurrently active agents.
163
  \end{itemize}
164
+ Metrics are monitored against predefined \textbf{budget limits}. Actions (like hiring) exceeding limits (e.g., $>$90\% memory, exceeding max concurrency) are prevented. This ensures operation within constraints, crucial for limited resources or strict budgets.
165
 
166
  \subsection{Tool Utilization and Autonomous Creation}
167
  \label{subsec:tooling}
 
176
  \end{enumerate}
177
  This allows HASHIRU to dynamically extend its functional repertoire, tailoring capabilities to tasks without manual intervention, enabling greater autonomy and adaptation.
178
 
179
+ \subsection{Memory Function: Learning from Experience}
180
+ \label{subsec:memory}
181
+
182
+ To enable HASHIRU agents to learn from past interactions and rectify previous errors, a \textbf{Memory Function} is incorporated. This function stores records of significant past events, particularly those involving failed attempts or suboptimal outcomes, acting as a historical log of experiences. When the system encounters a new problem or a recurring challenge, it queries this memory store to retrieve relevant past situations and their outcomes.
183
+
184
+ Memory retrieval is based on semantic similarity between the current context (e.g., task description, recent actions, error messages) and the stored memory entries. We utilize embeddings generated by the \textbf{all-MiniLM-L6-v2} model \cite{wang2020minilmdeepselfattentiondistillation} to represent both the query and the stored memories in a high-dimensional vector space. Relevance is determined by calculating the \textbf{cosine similarity} between the query embedding and each memory embedding. Memories exceeding a predefined similarity threshold are retrieved and provided to the CEO agent (or relevant Employee agents) as contextual information. This allows the system to draw upon past experiences, understand why previous approaches failed, and potentially adjust its strategy to avoid repeating mistakes, thereby improving performance and efficiency over time.
185
+
186
  \section{Experimental Setup}
187
  \label{sec:experiments}
188
 
paper/references.bib CHANGED
@@ -19,6 +19,16 @@
19
  year={2024}
20
  }
21
 
 
 
 
 
 
 
 
 
 
 
22
  @misc{gemini25flash,
23
  title = {Gemini 2.5 Flash: Model Card, API, and Announcement},
24
  author = {{Google DeepMind} and {Google AI}},
 
19
  year={2024}
20
  }
21
 
22
+ @misc{wang2020minilmdeepselfattentiondistillation,
23
+ title={MiniLM: Deep Self-Attention Distillation for Task-Agnostic Compression of Pre-Trained Transformers},
24
+ author={Wenhui Wang and Furu Wei and Li Dong and Hangbo Bao and Nan Yang and Ming Zhou},
25
+ year={2020},
26
+ eprint={2002.10957},
27
+ archivePrefix={arXiv},
28
+ primaryClass={cs.CL},
29
+ url={https://arxiv.org/abs/2002.10957},
30
+ }
31
+
32
  @misc{gemini25flash,
33
  title = {Gemini 2.5 Flash: Model Card, API, and Announcement},
34
  author = {{Google DeepMind} and {Google AI}},