Structuring Text Data for Reliable AI Pulls

Summary

In CommunityOne Builders Help this announcement covers how to structure long txt data for multiple pulls to improve consistency and reduce AI hallucinations. It recommends limiting double newlines so embeddings keep related content together, notes markdown import newline issues, and asks about using agents for processing.

youngnoodle-35 OP

For long data structures that will have multiple pulls, how should I be structuri g the data? This is an example of my current structure

youngnoodle-35 OP

I'm wanting the layout and data to remain consistent, however I noticed that the Ai sometimes hallucinates

happybeacon-37

huh, this one i am not sure, i know in previous version, when we were using chatgpt, having it in paragrah works well because this is how we chunk the content, but any thoughts on this

sharpquartz-72

Is this a common ocurrence? I see that you have a lot of text lines separated with double newlines like this:


### Alt-Fire

**Damage:** impact: 58.2, slash: 29.1, puncture: 9.7

Crit Chance: 21.00% | Crit Multiplier: 2.3x | Status Chance: 33.00% | Fire Rate: 1.5

### Alt-Fire Explosion

**Damage:** blast: 789

Crit Chance: 21.00% | Crit Multiplier: 2.3x | Status Chance: 33.00% | Fire Rate: 1.5

We do use double newlines to separate chunks of text for our vector embeddings. So maybe keeping related content separated with a single newline at most would help, like this:

### Alt-Fire
**Damage:** impact: 58.2, slash: 29.1, puncture: 9.7
Crit Chance: 21.00% | Crit Multiplier: 2.3x | Status Chance: 33.00% | Fire Rate: 1.5

### Alt-Fire Explosion
**Damage:** blast: 789
Crit Chance: 21.00% | Crit Multiplier: 2.3x | Status Chance: 33.00% | Fire Rate: 1.5
youngnoodle-35 OP

I'll try that. Thank you

youngnoodle-35 OP

It was separated because I was using an MD conversion from an import, I just forgot to account for new lines

happybeacon-37

btw, on a seperate note, are you familar with agents?

youngnoodle-35 OP

I haven't used specific ones

The latest from CommunityOne Builders Help

Most Used Discord Bots in CommunityOne

Thread exploring the most popular server bots for staff and members, from Saphire and Tatsu to QOTD and Mee6. Share what your community uses

Help Needed Setting Up Custom Quest

Troubleshoot custom quest setup, External URL questions, and save issues in CommunityOne Builders Help.