Spaces:
Running
Running
Kunal Pai
commited on
Commit
·
58408af
1
Parent(s):
adfd14d
Memory function description added to paper
Browse files- paper/conference_101719.tex +8 -1
- paper/references.bib +10 -0
paper/conference_101719.tex
CHANGED
@@ -161,7 +161,7 @@ Explicit resource management is central, moving beyond simple API cost tracking.
|
|
161 |
\item \textbf{Memory Usage:} Footprint of active Employee agents (\% of allocated budget).
|
162 |
\item \textbf{Agent Concurrency:} Count of concurrently active agents.
|
163 |
\end{itemize}
|
164 |
-
Metrics are monitored against predefined \textbf{budget limits}. Actions (like hiring) exceeding limits (e.g.,
|
165 |
|
166 |
\subsection{Tool Utilization and Autonomous Creation}
|
167 |
\label{subsec:tooling}
|
@@ -176,6 +176,13 @@ A distinctive feature is \textbf{integrated, autonomous tool creation}. If the C
|
|
176 |
\end{enumerate}
|
177 |
This allows HASHIRU to dynamically extend its functional repertoire, tailoring capabilities to tasks without manual intervention, enabling greater autonomy and adaptation.
|
178 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
179 |
\section{Experimental Setup}
|
180 |
\label{sec:experiments}
|
181 |
|
|
|
161 |
\item \textbf{Memory Usage:} Footprint of active Employee agents (\% of allocated budget).
|
162 |
\item \textbf{Agent Concurrency:} Count of concurrently active agents.
|
163 |
\end{itemize}
|
164 |
+
Metrics are monitored against predefined \textbf{budget limits}. Actions (like hiring) exceeding limits (e.g., $>$90\% memory, exceeding max concurrency) are prevented. This ensures operation within constraints, crucial for limited resources or strict budgets.
|
165 |
|
166 |
\subsection{Tool Utilization and Autonomous Creation}
|
167 |
\label{subsec:tooling}
|
|
|
176 |
\end{enumerate}
|
177 |
This allows HASHIRU to dynamically extend its functional repertoire, tailoring capabilities to tasks without manual intervention, enabling greater autonomy and adaptation.
|
178 |
|
179 |
+
\subsection{Memory Function: Learning from Experience}
|
180 |
+
\label{subsec:memory}
|
181 |
+
|
182 |
+
To enable HASHIRU agents to learn from past interactions and rectify previous errors, a \textbf{Memory Function} is incorporated. This function stores records of significant past events, particularly those involving failed attempts or suboptimal outcomes, acting as a historical log of experiences. When the system encounters a new problem or a recurring challenge, it queries this memory store to retrieve relevant past situations and their outcomes.
|
183 |
+
|
184 |
+
Memory retrieval is based on semantic similarity between the current context (e.g., task description, recent actions, error messages) and the stored memory entries. We utilize embeddings generated by the \textbf{all-MiniLM-L6-v2} model \cite{wang2020minilmdeepselfattentiondistillation} to represent both the query and the stored memories in a high-dimensional vector space. Relevance is determined by calculating the \textbf{cosine similarity} between the query embedding and each memory embedding. Memories exceeding a predefined similarity threshold are retrieved and provided to the CEO agent (or relevant Employee agents) as contextual information. This allows the system to draw upon past experiences, understand why previous approaches failed, and potentially adjust its strategy to avoid repeating mistakes, thereby improving performance and efficiency over time.
|
185 |
+
|
186 |
\section{Experimental Setup}
|
187 |
\label{sec:experiments}
|
188 |
|
paper/references.bib
CHANGED
@@ -19,6 +19,16 @@
|
|
19 |
year={2024}
|
20 |
}
|
21 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
22 |
@misc{gemini25flash,
|
23 |
title = {Gemini 2.5 Flash: Model Card, API, and Announcement},
|
24 |
author = {{Google DeepMind} and {Google AI}},
|
|
|
19 |
year={2024}
|
20 |
}
|
21 |
|
22 |
+
@misc{wang2020minilmdeepselfattentiondistillation,
|
23 |
+
title={MiniLM: Deep Self-Attention Distillation for Task-Agnostic Compression of Pre-Trained Transformers},
|
24 |
+
author={Wenhui Wang and Furu Wei and Li Dong and Hangbo Bao and Nan Yang and Ming Zhou},
|
25 |
+
year={2020},
|
26 |
+
eprint={2002.10957},
|
27 |
+
archivePrefix={arXiv},
|
28 |
+
primaryClass={cs.CL},
|
29 |
+
url={https://arxiv.org/abs/2002.10957},
|
30 |
+
}
|
31 |
+
|
32 |
@misc{gemini25flash,
|
33 |
title = {Gemini 2.5 Flash: Model Card, API, and Announcement},
|
34 |
author = {{Google DeepMind} and {Google AI}},
|