Spaces:

Agents-MCP-Hackathon
/

rss-mcp-server

Runtime error

gperdrizet commited on Jun 5

Commit

e6f6cfa

unverified ·

1 Parent(s): e97f932

Added fence to prevent parsing empty HTML string.

Files changed (1) hide show

functions/helper_functions.py CHANGED Viewed

@@ -209,6 +209,9 @@ def get_html(url: str) -> str:
                 content = content.decode(encoding)
     except HTTPError:
         content = None
@@ -227,6 +230,9 @@ def get_text(html: str) -> str:
     Returns:
         Cleaned text string'''
     extractor = extractors.ArticleExtractor()
@@ -236,6 +242,11 @@ def get_text(html: str) -> str:
     except HTMLExtractionError:
         pass
     return clean_html(html)

                 content = content.decode(encoding)
+            else:
+                content = None
     except HTTPError:
         content = None
     Returns:
         Cleaned text string'''
+    if html is None:
+        return None
     extractor = extractors.ArticleExtractor()
     except HTMLExtractionError:
         pass
+    except AttributeError:
+        pass
+    except TypeError:
+        pass
     return clean_html(html)