Spaces:

anjali2002
/

UtensilDetector

Sleeping

+# 🍽️ Utensils Object Detection System
+Welcome to **Utensils Object Detection System** — an end-to-end pipeline that detects Utensils items like plates, glasses, spoons, and forkss using a custom-trained deep learning model.
+This project was built **from scratch** (no Roboflow or auto-annotation tools!) and demonstrates a full lifecycle: dataset creation, model training, performance evaluation, and an interactive demo app.
+---
+## 🏗️ Project Overview
+We set out to solve a real-world problem:
+> _“Can we reliably detect common Utensils items in images, videos, or real-time webcam streams using only a small, custom-labeled dataset?”_
+To achieve this, we:
+✅ Collected & annotated a custom dataset (100–500 images)
+✅ Built a clean Python codebase to handle training, inference, and deployment
+✅ Delivered an interactive demo using **Streamlit / Flask**
+---
+## 📁 Project Structure
+```
+├── app/                # Streamlit or Flask app for demo
+│   └── app.py
+├── dataset/            # Custom dataset (images + labels)
+│   ├── images/
+│   └── labels/
+├── inference/          # Inference scripts (image, video, webcam)
+│   ├── detect_image.py
+│   ├── detect_video.py
+│   └── detect_webcam.py
+├── runs/detect/        # Training results & saved weights
+│   ├── weights/
+│   ├── results.png
+│   └── Other Metrics ...
+├── training/           # Training pipeline
+│   ├── train.py
+│   └── model_training.ipynb
+├── data.yaml           # Dataset config
+├── requirements.txt    # Python dependencies
+└── README.md           # This file
+```
+---
+## 🗂️ Dataset
+- **Images collected:** Manually photographed or sourced from public domain (Kaggle)
+- **Classes:** Example — plate, fork, spoon, glass
+- **Annotation tool:** [LabelImg](https://github.com/heartexlabs/labelImg)
+- **Format:** YOLO txt labels
+---
+## 🏋️‍♂️ Model Training
+- **Framework:** YOLOv8
+- **Training script:** `training/train.py`
+- **Best checkpoint:** `runs/detect/weights/best.pt`
+- **Metrics logged:** loss curves, mAP, precision, recall, F1
+---
+## 🔍 Inference & Results
+- Run detection on:
+  - Static images → `inference/detect_image.py`
+  - Video files → `inference/detect_video.py`
+  - Real-time webcam → `inference/detect_webcam.py`
+- Visual outputs include:
+  - Bounding boxes with class names and confidence
+  - Confusion matrix
+  - Precision-recall, F1 curves
+---
+## 🌐 Interactive Demo
+Launch the demo app:
+```bash
+pip install -r requirements.txt
+streamlit run app/app.py
+```
+Features:
+- Upload image or video and get detections
+- View predicted bounding boxes + class names + confidence scores
+- (Optional) Real-time webcam support
+---
+## 🚀 Getting Started
+1️⃣ Clone the repo:
+```bash
+git clone https://github.com/yourusername/Utensils-object-detection.git
+cd Utensils-object-detection
+```
+2️⃣ Install dependencies:
+```bash
+pip install -r requirements.txt
+```
+3️⃣ Run training:
+```bash
+python training/train.py --data data.yaml
+```
+4️⃣ Try inference:
+```bash
+python inference/detect_image.py --source path/to/image.jpg
+```
+5️⃣ Launch app:
+```bash
+streamlit run app/app.py
+```
+Model summary (fused): 92 layers, 25,842,076 parameters, 0 gradients, 78.7 GFLOPs
+                 Class     Images  Instances      Box(P          R      mAP50  mAP50-95): 100%|██████████| 3/3 [00:02<00:00,  1.48it/s]
+                   all         40         40      0.681      0.725      0.731      0.468
+                  fork         10         10      0.338        0.2      0.265      0.113
+                 glass         10         10      0.643        0.9      0.888      0.432
+                 plate         10         10          1          1      0.995      0.833
+                 spoon         10         10      0.744        0.8      0.776      0.496
+---
+## 📊 Performance
+| Metric         | Value    |
+|---------------|----------|
+| mAP@0.5       | 78.0%    |
+| mAP@0.5:0.95  | 50.8%    |
+| Precision     | 85.5%    |
+| Recall        | 67.5%    |
+> _These numbers are based on our custom dataset; actual results may vary depending on data size and quality._
+---
+## 💡 Challenges & Learnings
+- **Challenge:** Small dataset size → risk of overfitting
+- **Solution:** Data augmentation and careful validation splitting
+- **Challenge:** Labeling errors → noisy annotations
+- **Solution:** Manual re-checking of all labels
+- **Challenge:** Real-time inference speed
+- **Solution:** Optimized image preprocessing pipeline
+---
+## 🛡️ License & Acknowledgments
+- Built using open-source tools: [Ultralytics YOLO](https://github.com/ultralytics/yolov5), [Streamlit](https://streamlit.io/)
+- Dataset annotated manually, no pre-annotated sources used
+- No external pre-trained models on non-custom data
+---
+If you like this project, ⭐ the repo and feel free to contribute!
+Happy detecting! 🍳🍴🥄

app/app.py ADDED Viewed

	@@ -0,0 +1,136 @@

+import streamlit as st
+from PIL import Image
+import cv2
+import numpy as np
+from ultralytics import YOLO
+import tempfile
+import os
+from streamlit.web.bootstrap import run
+# Disable file watcher
+os.environ["STREAMLIT_SERVER_ENABLE_FILE_WATCHER"] = "false"  # Disable watcher
+os.environ["STREAMLIT_SERVER_ENABLE_XSRF_PROTECTION"] = "false"
+# Configure environment to suppress warnings
+os.environ['TF_CPP_MIN_LOG_LEVEL'] = '3'
+os.environ["STREAMLIT_WATCHER_TYPE"] = "none"
+# Load model with caching
+@st.cache_resource
+def load_model():
+    return YOLO("best50.pt")
+# Initialize session state for webcam
+if 'webcam_active' not in st.session_state:
+    st.session_state.webcam_active = False
+# App title and layout
+st.title("Object Detection App")
+st.write("Upload an image/video or use your webcam")
+# Load model
+model = load_model()
+# Create tabs for different input sources
+tab_upload, tab_webcam = st.tabs(["Upload Media", "Webcam"])
+def process_image(img):
+    """Process image and return annotated version with results"""
+    img_array = np.array(img)
+    img_bgr = cv2.cvtColor(img_array, cv2.COLOR_RGB2BGR)
+    results = model(img_bgr, conf=0.5, iou=0.4)
+    annotated_img = results[0].plot()
+    return cv2.cvtColor(annotated_img, cv2.COLOR_BGR2RGB), results
+with tab_upload:
+    uploaded_file = st.file_uploader(
+        "Choose an image or video",
+        type=["jpg", "jpeg", "png", "mp4", "mov"],
+        label_visibility="collapsed"
+    )
+    if uploaded_file:
+        if uploaded_file.type.startswith('image'):
+            # Process image
+            image = Image.open(uploaded_file)
+            annotated_img, results = process_image(image)
+            # Display side by side
+            col1, col2 = st.columns(2)
+            with col1:
+                st.image(image, caption="Original Image", use_container_width=True)
+            with col2:
+                st.image(annotated_img, caption="Detected Objects", use_container_width=True)
+            # Show detection results
+            st.subheader("Detected Objects:")
+            for box in results[0].boxes:
+                class_name = model.names[int(box.cls)]
+                confidence = float(box.conf)
+                if confidence >= 0.5:
+                    st.write(f"- {class_name} (confidence: {confidence:.2f})")
+        elif uploaded_file.type.startswith('video'):
+            # Process video
+            with tempfile.NamedTemporaryFile(delete=False, suffix='.mp4') as tfile:
+                tfile.write(uploaded_file.read())
+                video_path = tfile.name
+            st.video(video_path)
+            # Process and show output video
+            with st.spinner('Processing video...'):
+                cap = cv2.VideoCapture(video_path)
+                fps = cap.get(cv2.CAP_PROP_FPS)
+                width = int(cap.get(cv2.CAP_PROP_FRAME_WIDTH))
+                height = int(cap.get(cv2.CAP_PROP_FRAME_HEIGHT))
+                output_path = tempfile.NamedTemporaryFile(delete=False, suffix='.mp4').name
+                out = cv2.VideoWriter(output_path, cv2.VideoWriter_fourcc(*'mp4v'), fps, (width, height))
+                while cap.isOpened():
+                    ret, frame = cap.read()
+                    if not ret:
+                        break
+                    results = model(frame)
+                    annotated_frame = results[0].plot()
+                    out.write(annotated_frame)
+                cap.release()
+                out.release()
+                st.video(output_path)
+                os.unlink(video_path)
+                os.unlink(output_path)
+with tab_webcam:
+    if st.checkbox("Start Webcam", key="webcam_toggle"):
+        st.session_state.webcam_active = True
+        st.write("Click below to stop the webcam")
+        cap = cv2.VideoCapture(0)
+        frame_placeholder = st.empty()
+        stop_button = st.button("Stop Webcam")
+        while cap.isOpened() and st.session_state.webcam_active:
+            ret, frame = cap.read()
+            if not ret or stop_button:
+                st.session_state.webcam_active = False
+                break
+            results = model(frame)
+            annotated_frame = results[0].plot()
+            frame_placeholder.image(
+                cv2.cvtColor(annotated_frame, cv2.COLOR_BGR2RGB),
+                channels="RGB",
+                use_container_width=True
+            )
+            if stop_button:
+                st.session_state.webcam_active = False
+                break
+        cap.release()
+        if stop_button:
+            st.success("Webcam stopped")

data.yaml ADDED Viewed

	@@ -0,0 +1,10 @@

+# kitchen.yaml
+path: dataset
+train: images/train
+val: images/test
+names:
+  0: fork
+  1: glass
+  2: plate
+  3: spoon

inference/detect_image.py ADDED Viewed

	@@ -0,0 +1,29 @@

+from ultralytics import YOLO
+import cv2
+import sys
+# Check for command-line argument
+if len(sys.argv) < 2:
+    print("Usage: python detect_image.py <image_path>")
+    sys.exit(1)
+image_path = sys.argv[1]
+# Load trained model
+model = YOLO("runs/detect/weights/best.pt")
+# Read image
+frame = cv2.imread(image_path)
+if frame is None:
+    print(f"Error: Could not read image at {image_path}")
+    sys.exit(1)
+# Run detection with conf and iou threshold
+results = model(frame, conf=0.5, iou=0.4, imgsz=640, augment=False)
+# Plot and save results
+annotated_img = results[0].plot()
+cv2.imwrite("output.jpg", annotated_img)
+cv2.imshow("Detection", annotated_img)
+cv2.waitKey(0)
+cv2.destroyAllWindows()

inference/detect_video.py ADDED Viewed

	@@ -0,0 +1,19 @@

+from ultralytics import YOLO
+import cv2
+model = YOLO("runs/detect/weights/best.pt")
+video_path = "test_video.mp4"
+cap = cv2.VideoCapture(video_path)
+while cap.isOpened():
+    ret, frame = cap.read()
+    if not ret: break
+    results = model(frame, conf=0.5, imgsz=640, augment=False)
+    annotated_frame = results[0].plot()
+    cv2.imshow("Video Detection", annotated_frame)
+    if cv2.waitKey(1) == ord('q'): break
+cap.release()
+cv2.destroyAllWindows()

inference/detect_webcam.py ADDED Viewed

	@@ -0,0 +1,18 @@

+from ultralytics import YOLO
+import cv2
+model = YOLO("runs/detect/weights/best.pt")
+cap = cv2.VideoCapture(0)  # 0 = default webcam
+while cap.isOpened():
+    ret, frame = cap.read()
+    if not ret: break
+    results = model(frame, conf=0.5, imgsz=640, augment=False)
+    annotated_frame = results[0].plot()
+    cv2.imshow("Webcam Detection", annotated_frame)
+    if cv2.waitKey(1) == ord('q'): break
+cap.release()
+cv2.destroyAllWindows()

input.jpg ADDED Viewed

Git LFS Details

SHA256: 4e49a16225bf40b05c434052e42b1066098d4c0fee218d94bc40ed5395bf5825
Pointer size: 131 Bytes
Size of remote file: 317 kB

output.jpg ADDED Viewed

Git LFS Details

SHA256: ef3856797e6f675bc39f0a8f8d29b6a461ed3ae04797058a74b7129873dc1e37
Pointer size: 132 Bytes
Size of remote file: 9.58 MB

requirements.txt ADDED Viewed

Binary file (2.29 kB). View file

runs/detect/F1_curve.png ADDED Viewed

Git LFS Details

SHA256: 32a466ab5d303000016fea2df5cb527af531e9e50c57d571b51fceedd59c53ed
Pointer size: 131 Bytes
Size of remote file: 153 kB

runs/detect/PR_curve.png ADDED Viewed

Git LFS Details

SHA256: d6113a211bfbd1e1f036674ec0a1ac54a9fd87f5ae056ab02a1add0057c8c5b6
Pointer size: 131 Bytes
Size of remote file: 156 kB

runs/detect/P_curve.png ADDED Viewed

Git LFS Details

SHA256: c2c55e5582ccdaec1c2b771d51054f64adfe3efd3567385599efa54c16f2208c
Pointer size: 131 Bytes
Size of remote file: 136 kB

runs/detect/R_curve.png ADDED Viewed

Git LFS Details

SHA256: 3991ddc82f0e50723ecbbe4089d403a5235c985f162eed8a438039298741d0ae
Pointer size: 131 Bytes
Size of remote file: 148 kB

runs/detect/args.yaml ADDED Viewed

	@@ -0,0 +1,105 @@

+task: detect
+mode: train
+model: yolov8m.pt
+data: data.yaml
+epochs: 45
+time: null
+patience: 20
+batch: 8
+imgsz: 640
+save: true
+save_period: 10
+cache: false
+device: null
+workers: 8
+project: null
+name: detector2
+exist_ok: false
+pretrained: true
+optimizer: auto
+verbose: true
+seed: 0
+deterministic: true
+single_cls: false
+rect: false
+cos_lr: true
+close_mosaic: 10
+resume: false
+amp: true
+fraction: 1.0
+profile: false
+freeze: null
+multi_scale: false
+overlap_mask: true
+mask_ratio: 4
+dropout: 0.0
+val: true
+split: val
+save_json: false
+conf: 0.25
+iou: 0.7
+max_det: 300
+half: false
+dnn: false
+plots: true
+source: null
+vid_stride: 1
+stream_buffer: false
+visualize: false
+augment: true
+agnostic_nms: false
+classes: null
+retina_masks: false
+embed: null
+show: false
+save_frames: false
+save_txt: false
+save_conf: false
+save_crop: false
+show_labels: true
+show_conf: true
+show_boxes: true
+line_width: null
+format: torchscript
+keras: false
+optimize: false
+int8: false
+dynamic: false
+simplify: true
+opset: null
+workspace: null
+nms: false
+lr0: 0.001
+lrf: 0.01
+momentum: 0.937
+weight_decay: 0.0005
+warmup_epochs: 3.0
+warmup_momentum: 0.8
+warmup_bias_lr: 0.1
+box: 7.5
+cls: 0.5
+dfl: 1.5
+pose: 12.0
+kobj: 1.0
+nbs: 64
+hsv_h: 0.015
+hsv_s: 0.7
+hsv_v: 0.4
+degrees: 0.0
+translate: 0.1
+scale: 0.5
+shear: 0.0
+perspective: 0.0
+flipud: 0.0
+fliplr: 0.5
+bgr: 0.0
+mosaic: 1.0
+mixup: 0.0
+cutmix: 0.0
+copy_paste: 0.0
+copy_paste_mode: flip
+auto_augment: randaugment
+erasing: 0.4
+cfg: null
+tracker: botsort.yaml
+save_dir: runs/detect/detector2

runs/detect/confusion_matrix.png ADDED Viewed

Git LFS Details

SHA256: e7036f2784deb31f752740ce589364166c5b1f0de93c30ab35d997d1a6d74ca9
Pointer size: 131 Bytes
Size of remote file: 111 kB

runs/detect/confusion_matrix_normalized.png ADDED Viewed

Git LFS Details

SHA256: be8be4842215da5fd9ea1ddffe0cac936da688bf21367434d003cc8fa63ba00f
Pointer size: 131 Bytes
Size of remote file: 131 kB

runs/detect/labels.jpg ADDED Viewed

Git LFS Details

SHA256: 9f67a1fa5ec886e603e276f31b76de993fa5bc4094a6769a82cc39edd96d7f0c
Pointer size: 131 Bytes
Size of remote file: 177 kB

runs/detect/results.csv ADDED Viewed

	@@ -0,0 +1,46 @@

+epoch,time,train/box_loss,train/cls_loss,train/dfl_loss,metrics/precision(B),metrics/recall(B),metrics/mAP50(B),metrics/mAP50-95(B),val/box_loss,val/cls_loss,val/dfl_loss,lr/pg0,lr/pg1,lr/pg2
+1,7.75734,1.78036,3.49556,2.01507,0.76609,0.325,0.39437,0.18047,2.10203,10.7005,2.18361,0.0002375,0.0002375,0.0002375
+2,15.423,1.79302,2.75923,1.85474,0.10526,0.2,0.06603,0.02793,2.87958,125.644,3.0908,0.000486912,0.000486912,0.000486912
+3,22.8104,1.94713,2.68974,1.95075,0.01597,0.05,0.01305,0.00572,3.23687,10.9756,4.06396,0.000733947,0.000733947,0.000733947
+4,29.5551,1.9183,2.58979,1.92766,0.00299,0.35,0.00262,0.00088,3.16908,659.959,5.41089,0.000976818,0.000976818,0.000976818
+5,36.9361,1.96711,2.706,2.00498,0.00173,0.075,0.00086,0.00028,3.3475,inf,5.57994,0.00121377,0.00121377,0.00121377
+6,43.6503,1.95869,2.6115,1.95863,0.00156,0.125,0.00094,0.00018,4.31249,inf,24.4792,0.00121268,0.00121268,0.00121268
+7,51.1376,1.96479,2.48776,2.07338,0.25637,0.15,0.00609,0.00189,3.0645,57.6018,4.1836,0.00119651,0.00119651,0.00119651
+8,58.2241,1.94281,2.37012,2.00123,0.00126,0.075,0.00072,0.00024,3.37767,99.9235,4.77505,0.00117757,0.00117757,0.00117757
+9,65.4209,1.88106,2.32684,1.99259,0.03353,0.25,0.03108,0.01404,2.58351,18.2153,2.76314,0.00115598,0.00115598,0.00115598
+10,72.8397,2.012,2.37121,1.9462,0.26343,0.1,0.01193,0.00264,2.86945,180.343,3.25406,0.00113183,0.00113183,0.00113183
+11,79.8439,1.86117,2.06471,1.84808,0.0904,0.1,0.05516,0.02857,2.77258,15.9436,3.18565,0.00110524,0.00110524,0.00110524
+12,91.7099,1.70296,2.0259,1.78789,0.6563,0.2,0.25778,0.0981,2.25306,4.36774,2.63195,0.00107634,0.00107634,0.00107634
+13,99.195,1.80577,2.01447,1.80319,0.46786,0.46085,0.402,0.13204,2.43435,3.47326,2.56699,0.00104527,0.00104527,0.00104527
+14,106.573,1.7805,2.02751,1.76633,0.36397,0.35,0.35821,0.13186,2.49812,3.8955,2.61041,0.00101219,0.00101219,0.00101219
+15,114.187,1.79277,1.87168,1.75102,0.47327,0.325,0.40062,0.20824,2.21791,2.76894,2.2815,0.000977251,0.000977251,0.000977251
+16,126.712,1.86401,2.22416,1.87106,0.34464,0.125,0.24458,0.11386,2.61298,3.79657,2.39263,0.000940625,0.000940625,0.000940625
+17,134.111,1.79036,2.00106,1.73161,0.46931,0.275,0.24856,0.12475,2.3213,4.52619,2.22079,0.000902492,0.000902492,0.000902492
+18,141.24,1.71544,1.96843,1.69301,0.34963,0.41783,0.40704,0.24095,1.95451,2.74367,1.92154,0.000863038,0.000863038,0.000863038
+19,153.147,1.6654,1.83614,1.66695,0.62241,0.3,0.31528,0.13595,2.29612,4.0426,2.32851,0.000822454,0.000822454,0.000822454
+20,160.716,1.5989,1.68419,1.66531,0.47069,0.15,0.19331,0.08635,2.4992,3.49593,2.48693,0.000780939,0.000780939,0.000780939
+21,167.696,1.71097,1.80517,1.67909,0.4244,0.6,0.49734,0.24719,1.87494,2.0764,2.03337,0.000738695,0.000738695,0.000738695
+22,182.047,1.63031,1.75042,1.66501,0.50595,0.55,0.53611,0.28425,1.93364,2.10613,2.06664,0.000695927,0.000695927,0.000695927
+23,193.937,1.67486,1.71635,1.6886,0.69444,0.375,0.57381,0.36873,1.94534,2.2838,2.0227,0.000652844,0.000652844,0.000652844
+24,201.92,1.58261,1.65864,1.6307,0.44792,0.375,0.46229,0.32119,1.85533,2.04479,1.97446,0.000609656,0.000609656,0.000609656
+25,209.38,1.59317,1.66635,1.604,0.51972,0.675,0.68986,0.40141,1.68553,1.80119,1.80373,0.000566573,0.000566573,0.000566573
+26,222.711,1.60071,1.60139,1.66942,0.61703,0.625,0.68858,0.4174,1.5544,1.79489,1.74148,0.000523805,0.000523805,0.000523805
+27,237.258,1.54978,1.67005,1.59467,0.6375,0.575,0.67006,0.41696,1.72084,1.65729,1.83008,0.000481561,0.000481561,0.000481561
+28,244.715,1.54939,1.57594,1.64142,0.65936,0.7,0.71757,0.43896,1.65917,1.61329,1.78028,0.000440046,0.000440046,0.000440046
+29,252.703,1.46182,1.5317,1.55495,0.54613,0.65,0.66185,0.43019,1.55526,1.46552,1.74899,0.000399462,0.000399462,0.000399462
+30,260.256,1.44586,1.57444,1.52382,0.66958,0.65,0.69523,0.40794,1.60213,1.43563,1.84778,0.000360008,0.000360008,0.000360008
+31,267.214,1.47418,1.5533,1.51499,0.74268,0.75,0.77259,0.43295,1.6962,1.4413,1.86702,0.000321875,0.000321875,0.000321875
+32,280.385,1.51454,1.51662,1.56526,0.76953,0.75,0.79032,0.47132,1.66749,1.42884,1.80474,0.000285249,0.000285249,0.000285249
+33,294.255,1.40159,1.45345,1.47245,0.78169,0.67774,0.7599,0.48878,1.65617,1.48197,1.76941,0.000250309,0.000250309,0.000250309
+34,306.081,1.45523,1.42562,1.51033,0.8331,0.71207,0.76877,0.50295,1.6183,1.45446,1.74654,0.000217225,0.000217225,0.000217225
+35,314.158,1.51728,1.4611,1.53849,0.83402,0.67335,0.7337,0.49892,1.61208,1.43519,1.75238,0.000186158,0.000186158,0.000186158
+36,322.657,1.52999,1.56233,1.7411,0.75,0.65,0.72847,0.47787,1.61425,1.37957,1.73973,0.00015726,0.00015726,0.00015726
+37,329.571,1.513,1.49616,1.71639,0.94097,0.65,0.80974,0.53109,1.67459,1.40905,1.73769,0.000130671,0.000130671,0.000130671
+38,343.315,1.48147,1.45212,1.7249,0.94097,0.675,0.82224,0.4982,1.68408,1.39715,1.73625,0.00010652,0.00010652,0.00010652
+39,350.613,1.45414,1.34207,1.67427,0.87401,0.675,0.80025,0.49424,1.64481,1.33722,1.73205,8.49262e-05,8.49262e-05,8.49262e-05
+40,358.163,1.40402,1.34877,1.71041,0.86985,0.675,0.77389,0.48173,1.62943,1.30801,1.72339,6.59937e-05,6.59937e-05,6.59937e-05
+41,365.091,1.3864,1.35002,1.63678,0.86862,0.675,0.7746,0.47976,1.59893,1.29388,1.71103,4.98152e-05,4.98152e-05,4.98152e-05
+42,377.869,1.35166,1.30164,1.64118,0.90471,0.675,0.78746,0.50376,1.5849,1.27825,1.69229,3.64693e-05,3.64693e-05,3.64693e-05
+43,385.314,1.37277,1.2453,1.62867,0.89678,0.675,0.78457,0.49722,1.57498,1.2606,1.67731,2.60212e-05,2.60212e-05,2.60212e-05
+44,392.27,1.41298,1.27781,1.64009,0.86004,0.675,0.79572,0.50276,1.5718,1.25428,1.67299,1.85216e-05,1.85216e-05,1.85216e-05
+45,399.724,1.37738,1.28799,1.61474,0.85493,0.675,0.7802,0.50775,1.56201,1.26131,1.67392,1.40072e-05,1.40072e-05,1.40072e-05

runs/detect/results.png ADDED Viewed

Git LFS Details

SHA256: 610b9989dfa77643a858cada7677eb29297fe8bbfb1b6653fff1eca2a33ac248
Pointer size: 131 Bytes
Size of remote file: 307 kB

training/model_training.ipynb ADDED Viewed

The diff for this file is too large to render. See raw diff

training/train.py ADDED Viewed

	@@ -0,0 +1,21 @@

+from ultralytics import YOLO
+#base model
+model = YOLO("yolov8m.pt")  #for better accuracy
+model.train(
+    data="data.yaml",
+    epochs=45,                # increased to allow better convergence
+    patience=20,               # early stopping if no val improvement
+    imgsz=640,                 # image size FOR ACCURACY
+    batch=8,
+    conf=0.25,                 # initial THRESHOLD
+    name="detector",
+    augment=True,              # enables data augmentation
+    auto_augment='randaugment',# advanced augmentation
+    lr0=0.001,                 # initial learning rate
+    cos_lr=True,               # cosine learning rate schedule (smoother training)
+    save=True,
+    save_period=10,            # save weights every 10 epochs
+)