LLM Evaluation of Medical Videos

This folder contains code and scripts for using large language models (LLMs) in evluating the credibility of YouTube medical videos. Evaluation is done on the transcripts of the videos, which was generated using this script.

Subfolders

Claude_Evaluation
Gemini_Evaluation
HuggingFace_Evaluation
OpenAI_Evaluation
Results_Analysis

Contains scripts for analyzing the results of LLM evaluations:
- all_LLMs_plots_and_statistics.ipynb
  - Script for generating plots and statistics for evaluations of all LLMs.
- analyse_LLMs_responses.ipynb
  - Script for analyzing the responses of each LLM.
- one_LLM_plots_and_statistics.ipynb
  - Script for generating plots and statistics for a single LLM.
- statistics_plots_analysis_utils.py
  - Utility functions for generating plots and analyzing statistics.

Files

llm_evaluation_utils.py
- Utility functions for LLM evaluation.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

LLM Evaluation of Medical Videos

Subfolders

Files

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
Claude_Evaluation		Claude_Evaluation
Gemini_Evaluation		Gemini_Evaluation
HuggingFace_Evaluation		HuggingFace_Evaluation
OpenAI_Evaluation		OpenAI_Evaluation
Results_Analysis		Results_Analysis
.gitignore		.gitignore
README.md		README.md
llm_evaluation_utils.py		llm_evaluation_utils.py

madatr/LLM-Interfacing

Folders and files

Latest commit

History

Repository files navigation

LLM Evaluation of Medical Videos

Subfolders

Files

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages