2025-03-31 FaradAI Dataset V0

 

1. Summary

Total unfiltered videos: 184162, 3.4T (currently still downloading more) Number of filtered videos: 10141, 150GB (5% usable so far) Containing people: 3183 (~30%)

 

image-20250331123434804

 

image-20250331130027291

image-20250331125810945

 

image-20250331132916307

 

image-20250331144900264

 

 

2. Labelling methodology

  1. Extract 3 frames (beginning 10%, middle, end 10%)

  2. Military Object Detector based on Yolo11 trained on

    1. Datasets:

      1. https://data.mendeley.com/datasets/njdjkbxdpn/1

      2. https://universe.roboflow.com/zeki-8vreq/vehicle-a5vis/browse?queryText=&pageSize=50&startingIndex=0&browseQuery=true

      3. https://universe.roboflow.com/lee1/hj-hnab7/browse?queryText=&pageSize=50&startingIndex=0&browseQuery=true

      4. https://universe.roboflow.com/dk-uun3s/danger-fxawo

      5. https://universe.roboflow.com/ships-kev8a/military-ship-detection/browse?queryText=&pageSize=50&startingIndex=0&browseQuery=true

      6. https://universe.roboflow.com/hanif-noer-r/military-ships/browse?queryText=&pageSize=50&startingIndex=0&browseQuery=true

      7. https://universe.roboflow.com/seaobjects-2rxjz/8_final

      8. https://universe.roboflow.com/ttest/bb-ajv15

      9. https://universe.roboflow.com/object-detection-1y7d0/military-aircraft-iaddd/browse?queryText=&pageSize=50&startingIndex=0&browseQuery=true

      10. https://universe.roboflow.com/aiden-lee-roboflow/aircraft-detection-thesis/browse?queryText=&pageSize=50&startingIndex=0&browseQuery=true

      11. https://universe.roboflow.com/planes-zmdv1/aircraft-classification-2/browse?queryText=class:commercial&pageSize=50&startingIndex=0&browseQuery=true

    2. Classes 0: person 1: military vehicle 2: civilian vehicle 3: military aircraft 4: civilian aircraft 5: military watercraft 6: civilian watercraft

  3. vit-gpt2-image-captioning -> Zero-shot image captioning

  4. LLM2CLIP-Llama-3-8B-Instruct-CC-Finetuned -> Predefined text labels to match images (https://share.yellowrobot.xyz/quick/2025-4-24-4CD8B1B3-8572-409B-ACB9-5E8A4644C92F.zip)

  5. Keyword filtering using manually crafted algorithm

  6. Manual filtering using manual labeller

 

3. Examples

Some of examples of dataset, each video contains 3 labelled frames, defaced (blured) video and JSON label file. Below are randomly selected samples from the dataset!

3.1. Military Watercraft: Ship

apwagner_7846_0943136f-b229-42f7-8ce9-ee6ed778691a.mp4

 

boris_rozhin_92803_42b7975a-6784-4690-aff8-3b267a777445.MP4

boris_rozhin_141267_4a5d4536-6352-4bda-9205-e1347f6a500c.mp4

3.2. Military Watercraft: Submarine

boris_rozhin_48823_9f7938c6-3856-4366-80cd-9e085f2b0198.mp4

 

3.3. Military Vehicle: APC

ab3army_780_49507586-cb0d-4437-8d6f-82da4f3c7c3e.MOV

apwagner_21098_624d67a3-70c4-4658-a371-77ce6710334d.mp4

boris_rozhin_33060_1d15c89b-07d9-4d1b-81af-141bedaf599c_defaced.mp4

3.4. Military Vehicle: Tank

wargonzo_21991_7161e01a-ee8b-43ef-a078-836fc3c7ff13_defaced.mp4

 

RVvoenkor_82008_01aee5e5-fd03-41f3-9c95-310b00db2cd7.mp4

 

3.5. Military Vehicle: Multiple launch rocket system

RVvoenkor_18612_0c0966a3-c0d8-4e81-a24c-cfd0c89fd975.mp4

apwagner_10044_c0796b28-39cc-45c9-972e-3af40987985c.mp4

 

3.6. Military aircraft: fighter jet

apwagner_16988_be589a28-c472-4684-ab3e-576b6a7b4276.mp4

boris_rozhin_34403_de178228-2da9-4fbb-bc06-7c27da3a0e29.mp4

boris_rozhin_32147_f4928272-3568-4fee-9bf3-aa7181d88bab.mp4

 

3.7. Military aircraft: helicopter

boris_rozhin_35111_27b7f0bd-2f84-45d5-b47a-34703afd6f55.mp4

boris_rozhin_34808_08ce9d86-8187-46e5-9edd-806d7464b919.mp4

apwagner_31792_529b364b-cda7-4d72-8ae0-aa48a77c1a7d_defaced.mp4

 

3.8. Military aircraft: Drones and drone footage

ab3army_3123_af378ed7-7663-45af-88a6-4e2f2dff84a4.MP4

boris_rozhin_92736_0997a867-c9ec-42b7-bf4f-8e7761ba502d.MP4

3.9. Infrared footage

apwagner_10471_9ac97b63-d006-45f5-9431-feb58418f424.mp4

boris_rozhin_123417_b1d66471-f4d5-44f9-8e15-0ca05b88623a.mp4

 

4. Further work

Currently still scraping data from military conflicts in Ukraine and Middle-East. Improving quality of the dataset by manual filtering and testing out multi-modal Gemma 3, Qwen 2.5 VL, Llama 4 image captioning capabilities. Also it has been planned to test instead of LLM2CLIP other embbeding based mathing models which do not have blocked military content like JinaCLIP, CLIPBert and others.