Week 9 - Testing
May 3, 2025
Hi.
Progress this week was a bit slow. I once again read more papers to prepare for writing my research paper. I talked about that last week and there wasn’t much new so I won’t bore you with the details.
More importantly, I finally was able to do some practical testing on AI models myself on Chatbot Arena. Two weeks ago, I talked about the different methods of jailbreaking LLMs, such as Do Anything Now (DAN), Greedy Coordinate Gradient (GCG), multishot, etc. This week I put some of them into practice.
Using prompts collected from a red-teaming database, each of which represented a different approach to jailbreaking, I tested them for a variety of LLMs that were available on Chatbot Arena. The data I collect from this will be used to conduct a comparative analysis on the weaknesses of different AI models in my final product. I won’t spoil the results now though.
That’s all. Snabal bob shilzibwibel.

Leave a Reply
You must be logged in to post a comment.