★ Mr.Pimp · Methodology

How we test

Last updated: June 1, 2026 · 9 min read

How Mr.Pimp's reviews are produced — from app discovery to publication. Every step documented so you can verify our work.

The short version: Each app is tested for 5-15 hours across multiple sessions. Five categories scored: Dialogue (30%), NSFW (25%), Privacy (15%), Customization (15%), Value (15%). Affiliate status has zero weight in scoring. Every review is re-verified every 3-6 months.

1. App discovery

We discover apps to test through:

Reader submissions via the contact form — about 40% of our test queue.
App store monitoring — iOS App Store and Google Play AI-companion category, monthly sweeps.
Search trend data — what people are searching for in the AI companion space.
Industry monitoring — funding announcements, product launches, beta lists.
Competitor mentions — apps mentioned in other reviews we trust.

Affiliate program existence is NOT a criterion for testing. If an app is popular or interesting, we test it whether or not we can monetize the review.

2. Testing protocol

Every app gets the same testing protocol:

Session 1 (1-2 hours): First impressions

Sign up with a fresh account (separate email, separate payment method)
Complete onboarding without skipping
Try 3-5 default companions for ~20 messages each
Test claimed free tier features end-to-end
Note: signup friction, paywalls, dark patterns

Session 2-3 (2-4 hours): Depth testing

Pick one companion. Have an extended conversation (100+ messages).
Test memory across context: "What did I tell you about [X]?"
Test character consistency under stress (sarcasm, contradiction, emotional pivot).
Test NSFW capability across the spectrum (vanilla, edge cases, refusals)
Test customization depth (personality, appearance, voice, kinks)

Session 4+ (2-5 hours): Feature breadth

Test image generation (count seconds, count successful renders, note quality)
Test voice features (synthesis quality, latency, NSFW behavior)
Test mobile apps if available
If paywalled features matter, upgrade to Premium for at least one month

Session 5+ (1-2 hours): Receipts

Document exact hours spent, messages sent, dollars spent
Screenshot key interactions for the review
Cancel or downgrade subscription if no longer needed

3. Scoring system

Every app is scored 0-10 in five categories. Final score is a weighted average:

Dialogue (30%): Character consistency, conversational depth, memory, handling of edge cases.
NSFW capability (25%): Uncensored quality, image generation, voice features. SFW-only apps score 0 here.
Privacy (15%): Encryption, deletion policy, third-party sharing, transparency reports.
Customization (15%): Depth of character creation, personality controls, kink/preference settings.
Value (15%): Real monthly cost (including tokens/extras) vs feature delivery.

Affiliate program: 0% weight. An app's commission rate has no influence on its score.

4. What the scores mean

9.0-10: Best in class. Genuinely worth your time and money. Editor's pick territory.
8.0-8.9: Very good. Some weaknesses but strong overall delivery.
7.0-7.9: Good with notable gaps. Worth considering if specific strengths match your needs.
6.0-6.9: Mediocre. Better options exist unless one specific feature is decisive.
5.0-5.9: Poor. Significant problems. Not recommended.
Below 5.0: Avoid. Either misleading marketing, broken product, or harmful practices.

5. Transparency requirements

Every Mr.Pimp review publishes:

Exact hours tested — visible in the article header.
Exact messages sent — visible in the article header.
Exact dollars spent — visible in the article header.
Test log — dates of original test and all re-tests, shown in sidebar.
Verdict bullets — three quick takes (positive + warnings) for skimmability.
Pros and cons — explicit, specific, no hedging.
Hot take — the editor's straight-talk paragraph that summarizes the verdict in plain language.

6. Re-testing cadence

Reviews are re-verified every 3-6 months. We check:

Pricing changes (token costs, subscription tier prices)
Feature additions or removals
Companion library changes
Privacy policy changes
Major UX changes

If significant changes warrant rescoring, we re-test fully. Re-test count is shown on every review ("3× re-verified").

7. Corrections and disputes

If you find an error in a review, email corrections@mr-pimp.com. We respond to all credible correction requests within 7 days. Major corrections are flagged with a public changelog entry on the affected review.

If you are an app vendor: we accept correction requests on factual matters (pricing, features, etc.) immediately. We do not accept requests to alter editorial opinion, score, or framing.

8. What can change a score

Score changes between editions happen because of:

Product changes: features added or removed, pricing changes, paywall additions or removals.
New competitive context: if a new app raises the bar in a category, existing apps may be rescored relative to the new ceiling.
Methodology refinements: we occasionally adjust weightings or add subcategories. When this happens, all affected reviews are rescored and changelog entries note the methodology shift.

Score changes are never driven by affiliate program changes.

9. Affiliate disclosure

Outgoing links on Mr.Pimp routed through /go/{vendor}/ are affiliate links when the vendor has a program. We earn a commission when readers sign up via these links. The commission has no influence on rankings.

Apps without affiliate programs are linked using /go/{vendor}/ redirects too, but those redirects are not monetized.

The rel="sponsored noopener" HTML attribute is applied to affiliate links per Google's outbound link guidelines.

Want us to test a specific app?

Email contact@mr-pimp.com with the app name and URL. We add ~3-5 apps to the test queue per month.