Week 9 - Testing

May 3, 2025

Hi.

Progress this week was a bit slow. I once again read more papers to prepare for writing my research paper. I talked about that last week and there wasn’t much new so I won’t bore you with the details.

More importantly, I finally was able to do some practical testing on AI models myself on Chatbot Arena. Two weeks ago, I talked about the different methods of jailbreaking LLMs, such as Do Anything Now (DAN), Greedy Coordinate Gradient (GCG), multishot, etc. This week I put some of them into practice.

Using prompts collected from a red-teaming database, each of which represented a different approach to jailbreaking, I tested them for a variety of LLMs that were available on Chatbot Arena. The data I collect from this will be used to conduct a comparative analysis on the weaknesses of different AI models in my final product. I won’t spoil the results now though.

That’s all. Snabal bob shilzibwibel.

View more of Vincent Y.'s posts.

Week 9 - Testing

Reader Interactions

Leave a Reply Cancel reply