Spaces:

DanielSwift
/

FlexChunk

Sleeping

App Files Files Community

DanielSwift commited on Apr 10

Commit

77f0110

1 Parent(s): a7585d3

Update README with 300M capability and clearer description

Browse files

Files changed (1) hide show

README.md +27 -23

README.md CHANGED Viewed

@@ -4,48 +4,52 @@ emoji: ⚡
 colorFrom: blue
 colorTo: green
 sdk: gradio
-sdk_version: 4.39.0 # Or check your gradio version
 app_file: app.py
 pinned: false
 ---
-# FlexChunk: Enabling 100M×100M Out-of-Core SpMV
-This Hugging Face Space demonstrates **FlexChunk**, an algorithm for performing Sparse Matrix-Vector Multiplication (SpMV) on matrices potentially too large to fit in RAM.
-**Key Idea:** Divide the matrix into manageable horizontal chunks, process them sequentially, and use minimal memory (~1.7 GB for 100M×100M SpMV).
 ## Interactive Demo
-The app above allows you to:
-1.  Generate a sparse matrix (adjust **Size**, **Density**, **Chunks**, **Challenging** flag).
-2.  Run SpMV using **FlexChunk** (chunking to disk) and **SciPy** (emulating disk load).
-3.  Compare performance (timings) and correctness.
-**Note:** Larger matrices/densities will take longer. Limits are set for public infrastructure.
 ## Performance Highlights
-FlexChunk demonstrates near-linear scaling in time and memory for increasing matrix dimensions.
-**Time Performance and Memory Usage:** See the original article for detailed performance graphs and analysis.
-### Benchmark Results
-Selected results showing performance on large matrices:
 | Matrix Size        | Non-zero Elements | Total Time      | Peak RAM Usage |
 |--------------------|-------------------|-----------------|----------------|
-| ...                | ...               | ...             | ...            |
 | 1.0M × 1.0M        | 1.2M              | 1.07 s          | 17.00 MB       |
 | 10.0M × 10.0M      | 12.0M             | 10.21 s         | 170.00 MB      |
-| 30.0M × 30.0M      | 36.0M             | 31.13 s         | 510.00 MB      |
 | 50.0M × 50.0M      | 62.5M             | 55.27 s         | 850.00 MB      |
-| 70.0M × 70.0M      | 88.2M             | 1 min 17.1 s    | 1.19 GB        |
-| **100.0M × 100.0M**| **120.0M**        | **1 min 47.1 s**| **1.70 GB**    |
-*(Full table in the original [README](https://github.com/your-repo/FlexChunk/blob/main/README.md) - **Update this link!**)*
-## More Information
-See the original post for technical details: [FlexChunk: Enabling 100M×100M Out-of-Core SpMV](https://www.lesswrong.com/posts/zpRhsdDkWygTDScxb/flexchunk-enabling-100m-100m-out-of-core-spmv-1-8-min-1-7-gb)

 colorFrom: blue
 colorTo: green
 sdk: gradio
+sdk_version: 4.39.0
 app_file: app.py
 pinned: false
 ---
+# FlexChunk: Enabling Massive Out-of-Core SpMV
+This interactive demo showcases **FlexChunk**, an algorithm for performing Sparse Matrix-Vector Multiplication (SpMV) on matrices too large to fit in RAM.
+**Key Benefit:** Process matrices up to 300M×300M using only ~1.7 GB RAM by dividing them into manageable horizontal chunks.
 ## Interactive Demo
+The application above provides two modes:
+- **Standard Mode**: For matrices up to 200K×200K with optional SciPy comparison
+- **Advanced Mode**: For matrices up to 300M×300M using FlexChunk only
+You can adjust parameters including:
+- Matrix size
+- Density
+- Number of chunks
+- Matrix type (standard or challenging with extreme values)
 ## Performance Highlights
+FlexChunk demonstrates near-linear scaling in both time and memory usage:
 | Matrix Size        | Non-zero Elements | Total Time      | Peak RAM Usage |
 |--------------------|-------------------|-----------------|----------------|
 | 1.0M × 1.0M        | 1.2M              | 1.07 s          | 17.00 MB       |
 | 10.0M × 10.0M      | 12.0M             | 10.21 s         | 170.00 MB      |
 | 50.0M × 50.0M      | 62.5M             | 55.27 s         | 850.00 MB      |
+| 100.0M × 100.0M    | 120.0M            | 1 min 47.1 s    | 1.70 GB        |
+Our algorithm scales linearly to even larger matrices (up to 300M×300M) with proportional increases in processing time and memory.
+## How It Works
+FlexChunk operates in three main stages:
+1. **Matrix Division**: Splits the matrix into horizontal chunks and saves to disk
+2. **Sequential Processing**: Loads one chunk at a time to minimize memory usage
+3. **Result Accumulation**: Combines partial results into the final vector
+This approach makes it possible to multiply vectors with matrices that would otherwise exceed available RAM.
+## Links
+- **Source Code**: [GitHub Repository](https://github.com/DanielSwift1992/FlexChunk)
+- **Full Article**: [FlexChunk: Enabling 100M×100M Out-of-Core SpMV](https://www.lesswrong.com/posts/zpRhsdDkWygTDScxb/flexchunk-enabling-100m-100m-out-of-core-spmv-1-8-min-1-7-gb)