RepoAmazon (Nova)Amazon (Nova)published Nov 11, 2025seen 5d

amazon-science/semantic-volume

Python

Open original ↗

Captured source

source ↗
published Nov 11, 2025seen 5dcaptured 10hhttp 200method plain

amazon-science/semantic-volume

Language: Python

License: Apache-2.0

Stars: 0

Forks: 0

Open issues: 0

Created: 2025-11-11T20:43:24Z

Pushed: 2025-11-11T20:52:07Z

Default branch: main

Fork: no

Archived: no

README:

Using Semantic Volume to Detect both External and Internal Uncertainty of LLMs

🎉 Our paper has been accepted to [AAAI 2026]!

Studying external and internal uncertainty of LLMs. This repository provides the necessary code for running the Semantic Volume method for both external uncertainty detection (query ambiguity) and internal uncertainty detection (response uncertainty) of LLMs.

External Uncertainty:

The original clamber data can be downloaded here https://github.com/zt991211/CLAMBER. The necessary code for query augmentation and embedding generation are provided in extend_questions.py and generate_embeddings.py.

The code to run the Semantic Volume calculation for query ambiguity detection is in detect_query_ambiguity.py.

Internal Uncertainty:

Please put the original Trivia10K data (10K subset of the original TriviaQA data: https://nlp.cs.washington.edu/triviaqa/) in a data folder. The necessary code to sample candidate responses and embedding generation is provided in sample_llama_answers.py.

The code to run the Semantic Volume calculation for response uncertainty detection is in detect_response_uncertainty.py.

Security

See [CONTRIBUTING](CONTRIBUTING.md#security-issue-notifications) for more information.

License

This project is licensed under the Apache-2.0 License.

Notability

notability 3.0/10

Routine research code release