Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Spaces:
seanpedrickcase
/
document_redaction
like
5
Running
App
Files
Files
Community
Fetching metadata from the HF Docker repository...
f188b10
document_redaction
/
tools
676 kB
3 contributors
History:
151 commits
seanpedrickcase
Corrected an issue with finding valid language entities for AWS comprehend redaction
f188b10
about 1 month ago
__init__.py
Safe
0 Bytes
Initial commit
over 1 year ago
auth.py
Safe
2.46 kB
Added compatibility with gradio_image_annotation for passing through id and text properties to annotator. Corrected csv location for Textract api calls. Other minor changes
5 months ago
aws_functions.py
Safe
9.51 kB
Updated duplicate pages functionality. Improve redaction efficiency a little with concat method. Minor modification to documentation and interface
3 months ago
aws_textract.py
Safe
18.3 kB
Added support for other languages. Improved DynamoDB download
about 1 month ago
cli_redact.py
Safe
9.08 kB
Updated command line redaction script with more options
about 1 month ago
config.py
22.2 kB
Corrected Tesseract data location
about 1 month ago
custom_csvlogger.py
Safe
12.9 kB
Updated packages. Corrected CSV logger headings, can now submit custom log csv names to S3. Started work on identifying and deduplicating at the line level
3 months ago
custom_image_analyser_engine.py
68.7 kB
Corrected an issue with finding valid language entities for AWS comprehend redaction
about 1 month ago
data_anonymise.py
43.1 kB
Updated command line redaction script with more options
about 1 month ago
example_cli_calls.txt
Safe
769 Bytes
Repaired Dockerfile hopefully finally
about 1 month ago
file_conversion.py
109 kB
Updated command line redaction script with more options
about 1 month ago
file_redaction.py
139 kB
Updated command line redaction script with more options
about 1 month ago
find_duplicate_pages.py
58.5 kB
Added support for other languages. Improved DynamoDB download
about 1 month ago
helper_functions.py
Safe
30.2 kB
Added support for other languages. Improved DynamoDB download
about 1 month ago
load_spacy_model_custom_recognisers.py
24.6 kB
Corrected Tesseract data location
about 1 month ago
presidio_analyzer_custom.py
Safe
4.92 kB
More config options. Fixed some bugs with removing elements from review page and Adobe export. Some UI rearrangements
6 months ago
redaction_review.py
94.7 kB
Updated command line redaction script with more options
about 1 month ago
textract_batch_call.py
Safe
28 kB
Expanded checks for out of range page cropboxes
4 months ago