Structuring Text Data for Reliable AI Pulls

Summary

In CommunityOne Builders Help this announcement covers how to structure long txt data for multiple pulls to improve consistency and reduce AI hallucinations. It recommends limiting double newlines so embeddings keep related content together, notes markdown import newline issues, and asks about using agents for processing.

Original Post

For long data structures that will have multiple pulls, how should I be structuri g the data? This is an example of my current structure

Reply

I'm wanting the layout and data to remain consistent, however I noticed that the Ai sometimes hallucinates

Reply

huh, this one i am not sure, i know in previous version, when we were using chatgpt, having it in paragrah works well because this is how we chunk the content, but any thoughts on this

Reply

Is this a common ocurrence? I see that you have a lot of text lines separated with double newlines like this:


### Alt-Fire

**Damage:** impact: 58.2, slash: 29.1, puncture: 9.7

Crit Chance: 21.00% | Crit Multiplier: 2.3x | Status Chance: 33.00% | Fire Rate: 1.5

### Alt-Fire Explosion

**Damage:** blast: 789

Crit Chance: 21.00% | Crit Multiplier: 2.3x | Status Chance: 33.00% | Fire Rate: 1.5

We do use double newlines to separate chunks of text for our vector embeddings. So maybe keeping related content separated with a single newline at most would help, like this:

### Alt-Fire
**Damage:** impact: 58.2, slash: 29.1, puncture: 9.7
Crit Chance: 21.00% | Crit Multiplier: 2.3x | Status Chance: 33.00% | Fire Rate: 1.5

### Alt-Fire Explosion
**Damage:** blast: 789
Crit Chance: 21.00% | Crit Multiplier: 2.3x | Status Chance: 33.00% | Fire Rate: 1.5
Reply

I'll try that. Thank you

Reply

It was separated because I was using an MD conversion from an import, I just forgot to account for new lines

Reply

btw, on a seperate note, are you familar with agents?

Reply

I haven't used specific ones

The latest from CommunityOne Builders Help

24/7 Stage VC Radio Community Vibe

CommunityOne Builders Help turns Stage VC into a 24/7 radio bot so members can study, work, or chill together to shared music.

Server-to-Server Help Hub Proposal

Proposed forum to connect server owners and mods: share ideas, find partners, recruit mods, and showcase features.