What if the AI gets something wrong?

Fair worry — and the honest answer is, sometimes it will. Less often than the wild AI demos you've probably seen, but never zero. Every AI feature we ship is grounded in your real information so it's answering from facts instead of guessing. For high-stakes tasks (quotes that go to customers, contracts, money) we add a step where a person reviews before anything goes out. The skill is catching mistakes before your customer ever sees them.

Who pays for the AI usage — me or you?

You do, but it's billed directly to your OpenAI or Anthropic account, not ours. We don't resell AI usage or take a markup on what the AI costs to run. You see exactly what every interaction costs in your own dashboard, and if we ever part ways the account and any unused credit stays with you.

What does this actually cost to run per month?

For a small AI feature like a chatbot, usually $5–$50 per month in AI usage. For a bigger AI assistant reading through your documents, typically $30–$300 per month depending on how much your team or customers use it. We turn on a behind-the-scenes trick called prompt caching by default that drops the cost of repeat questions by 50–80%. You get a usage dashboard from day one so nothing surprises you.

What about the privacy of my data?

Big concern, totally understandable. Short version: your data stays yours. The two main AI providers we use (OpenAI and Anthropic) have contracts saying they don't use your data to train their models when accessed through their business APIs. Your data lives in your accounts, not a third-party AI service. For regulated industries (healthcare, finance, legal), we can also use Azure OpenAI or Amazon Bedrock for extra contractual coverage that satisfies HIPAA and similar requirements.

Claude, GPT, Gemini, Llama — which do you actually use?

All of the major ones — the answer really is whichever is best for the specific job. Claude (from Anthropic) is our default for careful writing, reasoning, and using business tools. GPT (from OpenAI) is great for fast, cheap classification work at high volume. Gemini (from Google) wins when there's a huge amount of context to process or when your business already lives in Google Workspace. Llama (from Meta) is the right pick when you need to run the AI on your own servers, especially in regulated industries. We write everything so the AI provider can be swapped with a single configuration change — when the next better model comes out, you upgrade without a rewrite.

आपके बिज़नेस के लिए AI

AI जो आपका actually समय बचाए — एक और tool नहीं जो आपको सीखना पड़े।

आपको एक और ChatGPT subscription नहीं चाहिए. आपको ऐसा AI चाहिए जो उस specific चीज़ को सँभाले जो आपके हफ़्ते को खा रही है — quotes draft करना, वही customer question 40वीं बार answer करना, phone उठाने से पहले client की history summarize करना. उन tools में बना हुआ जो आप पहले से इस्तेमाल करते हैं, आपके data में grounded, ऐसी कीमत पर जहाँ गणित सच में काम करता है. $500 से शुरू.

मुफ़्त डिस्कवरी कॉल बुक करें पैकेज देखें

कोई demo theatre नहीं — बस कुछ ऐसा जो अपनी कीमत निकाले.

क्या यह आपके लिए है?

क्या AI वाकई आपके बिज़नेस के लिए मायने रखता है?

सीधी बात, शुरू से.

AI शायद अपनी कीमत निकालेगा अगर…

आप वही task बार-बार कर रहे हैं — quotes draft करना, वही question reply करना, invoices sort करना — और यह हर हफ़्ते घंटे खा रहा है.
आपके पास data का ढेर है (orders, tickets, emails, PDFs, call notes) जिसे पढ़ने का किसी के पास समय नहीं, पर उसमें answers दबे हैं.
आपकी टीम हर दिन ग्राहकों से वही 10 सवाल झेलती है, और आपकी website उन्हें answer कर सकती थी अगर वह जानती कि कैसे.
आप एक और व्यक्ति hire किए बिना ज़्यादा काम लेना चाहते हैं जो उसके साथ आने वाला admin सँभाले.
आपने ख़ुद ChatGPT से खेला है और सोचा "इसे मेरे असली बिज़नेस में wire करने का कोई तरीक़ा होना चाहिए, अलग browser tab नहीं."

शायद रुक जाइए अगर…

असली bottleneck ये है कि अभी आपके पास data ही नहीं है। आपकी real information के बिना AI चीज़ें बना देता है — तो बेहतर पहला step है आपके data को एक जगह लाना। अगर useful हो तो उस हिस्से में मदद करने में ख़ुशी होगी, फिर हम बाद में AI पर वापस आते हैं।
आप "एक AI" चाहते हैं पर अभी किसी specific task पर उँगली नहीं रख सकते जो वो handle करे। पूरी तरह common है — AI अभी हर जगह है। Fix आसान है: अपनी plate पर एक specific, repetitive चीज़ सोचिए (quotes draft करना, वही question reply करना, paperwork sort करना) और उसे call पर लाइए। हम एक real problem का एक अच्छा answer बनाएंगे।
आपकी team ने process को पहले से एक spreadsheet में solve कर लिया है और वो genuinely काम करता है। अपना पैसा बचाइए — अगर हमें call पर यही मिले, तो हम आपको यही बताएंगे।
आप AI को ज़्यादातर इसलिए consider कर रहे हैं क्योंकि ये अभी trendy है। ये पूरी तरह समझ में आता है, पर एक beat के लायक है: customers आमतौर पर परवाह करते हैं कि आप phone उठाते हैं या नहीं, scenes के पीछे क्या चल रहा है ये नहीं। अगर हम कोई specific real problem ढूँढ सकें जिसे AI solve करेगा, बढ़िया। अगर नहीं, तो वो भी एक fine answer है।

पक्का नहीं? कॉल बुक करें →

अगर आप पक्का नहीं हैं आप किस तरफ़ हैं, मुफ़्त कॉल ठीक इसी के लिए है.

हर बिल्ड में

हम जो हर AI feature बनाते हैं उसे default में क्या मिलता है।

Upsells नहीं। हम AI को ईमानदार कैसे रखते हैं।

आपके असली data में grounded
पहले दिन से cost tracked
3 second से कम में answers
आप देख सकते हैं यह क्या कर रहा है
टूटने पर backup plan
risky चीज़ों के लिए human approval
prompts और data के मालिक आप हैं

दृष्टिकोण

हम वाकई AI features कैसे बनाते हैं।

हर task के लिए चुना गया. वहाँ embedded जहाँ काम पहले से हो रहा है. Promotion पाने से पहले मापा गया.

task के लिए सही tool

सोचने-वाले भारी काम के लिए Claude, तेज़ classification और extraction के लिए OpenAI, high-volume वाले grunt work के लिए छोटे सस्ते models. मिलाकर इस्तेमाल किए जाते हैं, किसी एक vendor से शादी नहीं की जाती — तो जब अगले महीने बेहतर model आता है, आप एक config change से upgrade करते हैं, rewrite से नहीं.

answer देने से पहले आपकी चीज़ें पढ़ता है

कोई भी चीज़ live जाने से पहले, हम उसे आपके असली business के 30–100 असली examples पर run करके score करते हैं। Accuracy, cost-per-answer, कितनी बार refuse करता है। अगर numbers bar पास नहीं करते, तो हम tune करते हैं या call करते हैं — आप ऐसे production code के लिए नहीं चुकाते जो test में fail हो।

शिप करने से पहले tested

कुछ भी live होने से पहले, मैं इसे आपके असली बिज़नेस से 30–100 असली examples के खिलाफ़ run करता हूँ और score करता हूँ. Accuracy, cost-per-answer, यह कितनी बार refuse करता है. अगर numbers बार पास नहीं करते, तो हम tune करते हैं या रोक देते हैं — आप ऐसे production code के लिए भुगतान नहीं करते जो test fail करता है.

जहाँ आप पहले से काम करते हैं वहाँ embedded

AI आपकी website, आपके ऐप, आपके email drafts, आपके Power Automate flow के अंदर दिखता है — एक और tab के रूप में नहीं जिसे याद रखना पड़े. सबसे अच्छा AI feature वही है जिसमें किसी को log in नहीं करना पड़ता.

हम हर दिन AI ship करते हैं — अपना और clients का।

हम Claude के साथ daily production code लिखते हैं, अपने Rec Soccer app के अंदर multi-step AI workflows चलाते हैं, और पिछले role में लगभग $40K/साल की vendor reporting को AI-assisted automation से replace किया। Attuned Ventures वो नहीं बेचता जो हमने पढ़ा — हम वो बेचते हैं जो पहले से हमारी मशीनों पर चल रहा है।

जो models से हम बनाते हैं

वे AI models जो आपके बिल्ड को चला रहे हैं.

हम एक vendor से बंधे नहीं हैं। हम वो model चुनते हैं जो आपके task पर जीते — accuracy, cost और speed — और जब बेहतर आए तो swap कर देते हैं। Usage आपके account पर bill होता है, हमारे पर नहीं, इसलिए आप कभी token markup नहीं चुकाते।

Claude

Anthropic

Reasoning, long-context काम, careful writing, और tool use के लिए मेरा default. ज़्यादातर tasks के लिए Claude Sonnet, मुश्किल वालों के लिए Opus.

GPT

OpenAI

सस्ती, तेज़ classification और structured extraction. जब आपको बड़े पैमाने पर "यह email पढ़ो और invoice number निकालो" चाहिए, तो मेरा go-to.

Gemini

Google

विशाल context windows और मज़बूत multimodal — पूरे PDFs, लंबी meeting transcripts, या पूरी codebases एक shot में पढ़ना. Google Workspace data के साथ अच्छा चलता है.

Llama

ElevenLabs

Voice

सबसे अच्छी voice जो मैंने सुनी — cloned voices, multilingual, conversational agents. जहाँ AI को इंसान जैसा सुनाई देना है, यह वही है.

Sora + Nano Banana

OpenAI · Google

Sora से video, Nano Banana (Google का Gemini 2.5 Flash Image) से images. Marketing assets, demo reels, ad creative, product mockups, और social posts के लिए — जब budget में camera crew या designer नहीं है.

sora.com deepmind.google

और जब वे अपनी कीमत निकालें तो specialty services — speech-to-text के लिए Whisper और Deepgram, regulated या enterprise contracts के लिए Azure OpenAI या Amazon Bedrock, open models के long tail के लिए Hugging Face. जो भी आपके task पर जीते.

पैकेज

शुरू करने के तीन तरीके. वो आकार चुनें जो फिट हो.

हर package आपके data में grounded है, launch से पहले evaluated, और आपके मालिकाना हक का। Ranges typical हैं — discovery call के बाद हम एक-page का fixed quote भेजते हैं।

Spark

सामान्य रेंज $500–$2K

टाइमलाइन: 1–2 हफ़्ते
स्कोप: एकल feature
भाषाएँ: अंग्रेज़ी

असली task पर discovery — क्या शामिल है, क्या अच्छा है, क्या टूटा है
Model + prompt 30+ असली examples के खिलाफ़ चुना और tune किया गया
Structured output (JSON schema) ताकि downstream tools उसका इस्तेमाल कर सकें
आपके मौजूदा inbox, CRM, sheet, या workflow में जुड़ा हुआ

Spark प्रोजेक्ट शुरू करें

Stack

सामान्य रेंज $2K–$8K

टाइमलाइन: 3–5 हफ़्ते
स्कोप: Multi-step workflow
भाषाएँ: अंग्रेज़ी + 1

Spark में सब कुछ
आपके docs, PDFs, sheets, Notion, या Slack export पर RAG pipeline
आपके content के लिए tune किया गया Vector store + chunking strategy
हर answer पर source citations — कोई "trust me" responses नहीं

Stack प्रोजेक्ट शुरू करें

Suite

सामान्य रेंज $8K–$15K+

टाइमलाइन: 6–12 हफ़्ते
स्कोप: Multi-feature, ऐप-embedded
भाषाएँ: 2–12 भाषाएँ

Stack में सब कुछ
Tool use — AI आपके APIs, CRM, calendar, ERP, या custom endpoints को call करता है
Retries, fallbacks, और human approval gates के साथ multi-step agent loops
जहाँ अपनी कीमत निकाले वहाँ Multimodal — voice (STT/TTS), images, PDFs

Suite प्रोजेक्ट शुरू करें

ऐड-ऑन

अतिरिक्त सुविधाएँ, जब वे अपनी कीमत निकालें.

इन्हें किसी भी पैकेज पर लगाएँ, या use-case बढ़ने पर बाद में जोड़ें.

RAG corpus / vector store

$800–$2.5K

आपके docs को ingest, chunk, embed, और index करें — और content बदलने पर एक re-ingest job. Firestore vector, Pinecone, या pgvector.

Voice (STT + TTS)

$1K–$3K

Speech-to-text के लिए Whisper या Deepgram, replies के लिए ElevenLabs या OpenAI voices. Phone, browser, या in-app.

Vision / image समझ

$800–$2K

Receipts, IDs, forms, screenshots, product photos पढ़ें. Structured data निकालें या तस्वीर में क्या है उसके बारे में सवालों के जवाब दें.

Eval harness

$600–$1.5K

एक test set, scoring rubric, और एक one-command runner ताकि आप शिप करने से पहले हर prompt या model change का प्रभाव देख सकें.

Prompt-versioning UI

$1K–$2.5K

एक छोटा admin panel जहाँ आप prompts edit कर सकते हैं, A/B variants test कर सकते हैं, और roll back कर सकते हैं — ऐप को redeploy किए बिना.

मासिक AI retainer

$300–$1K / माह

Prompt tuning, cost monitoring, नए models आने पर upgrades, और "इसने ऐसा क्यों किया" वाली investigations — on call.

प्रक्रिया

एक AI प्रोजेक्ट असल में कैसे चलता है.

कोई जादू-छड़ी demos नहीं. एक कॉल, एक scoped pilot, असली measurement, फिर production — या एक ईमानदार रुकावट.

20-मिनट डिस्कवरी कॉल

हम पूछते हैं कि आप कौन सा task fix करना चाहते हैं, आज कौन करता है, "अच्छा" कैसा दिखता है, और data कहाँ रहता है। अगर AI सही tool नहीं है, तो हम बता देंगे — कभी-कभी जवाब Power Automate flow या SQL view होता है।

Scoped pilot

48 घंटों के भीतर आपको एक तय कोट मिलता है — एक feature, एक model, एक success metric. हम आपके data से 30–100 असली examples के खिलाफ़ बनाते हैं, बनाए हुए नहीं.

Evals के साथ measure करें

कुछ भी live होने से पहले हम उसे score करते हैं. Accuracy, cost-per-call, p95 latency, refusal rate. अगर numbers बार पास नहीं करते तो हम tune करते हैं, models बदलते हैं, या रोक देते हैं — आप ऐसे production code के लिए भुगतान नहीं करते जो test fail करता है.

Ship और iterate

आपके app में रहता है, real time में monitor होता है, ऐसी logs के साथ जिन्हें आप देख सकते हैं। ज़्यादातर clients हमें छोटे retainer पर रखते हैं prompts tune करने और model upgrades पर सवारी करने के लिए; कुछ नहीं। दोनों ठीक हैं।

FAQ

जो सवाल लोग आमतौर पर पूछते हैं.

सभी विस्तार करें सभी समेटें

अगर AI कुछ ग़लत कर दे तो?

उन demos से कम जो आपने देखी हैं, लेकिन कभी zero नहीं — इसीलिए हर build आपके data में retrieval के साथ grounded है, structured outputs से validated है, और ship होने से पहले असली examples पर मापा जाता है। High-stakes tasks के लिए हम human approval step add करते हैं। ईमानदार जवाब है: AI गलतियाँ करता है, और engineering यह है कि उसे user से पहले पकड़ लें।

AI usage का payment कौन करता है — मैं या आप?

आपकी, सीधे आपके OpenAI / Anthropic account पर bill होती हैं। हम tokens दोबारा नहीं बेचते, न ही usage पर markup लेते हैं। आप देखते हैं कि हर call कितनी की है, और अगर कभी हम अलग हों, तो keys (और spend) आपके पास रहते हैं।

इसे प्रति माह चलाने में असल में क्या लागत आती है?

Spark feature के लिए, अक्सर $5–$50/महीना API calls में। RAG वाले Stack assistant के लिए, traffic के हिसाब से typically $30–$300/महीना। Prompt caching on हो तो, repeat queries 50–80% सस्ती हो सकती हैं। हम पहले दिन से dashboard दिखाते हैं ताकि कोई surprise न हो।

मेरे data की privacy का क्या?

OpenAI और Anthropic दोनों contractually default पर API traffic पर train नहीं करते। आपका data आपके accounts में रहता है (Firestore, Pinecone, S3 — जो आप use करें), AI calls आपके infrastructure से server-side होती हैं, और कुछ भी third-party SaaS पर log नहीं होता जब तक आप मांगें नहीं। Regulated काम के लिए हम Azure OpenAI या Bedrock भी use कर सकते हैं अतिरिक्त contractual coverage के लिए।

Claude, GPT, Gemini, Llama — आप असल में कौन सा इस्तेमाल करते हैं?

सारे major ones — answer सच में "specific job के लिए जो भी best हो" है। चार बड़े AI providers में से हर एक की अपनी strengths हैं: Claude (Anthropic से) careful writing, reasoning, और business tools इस्तेमाल करने के लिए हमारा default है। GPT (OpenAI से) high volume पर fast, सस्ती classification work के लिए बढ़िया है। Gemini (Google से) तब जीतता है जब process करने को बहुत बड़ी मात्रा में context हो — पूरे PDFs, लंबे meeting transcripts — या जब आपका business पहले से Google Workspace में रहता हो। Llama (Meta से) तब की सही pick है जब आपको AI को अपने servers पर चलाना हो, खासकर regulated industries में जहाँ data premises से बाहर नहीं जा सकता। आपके लिए अच्छी ख़बर: हम सब कुछ ऐसे लिखते हैं कि AI provider को एक single configuration change से swap किया जा सके। जब अगला बेहतर model आता है, आप बिना rewrite के upgrade करते हैं।

Technical लोगों के लिए: voice के लिए हम ElevenLabs इस्तेमाल करते हैं; image generation के लिए, Google का Gemini 2.5 Flash Image ("Nano Banana") और video के लिए OpenAI का Sora। सब एक ही abstraction layer से route होते हैं।

बात करते हैं

बताइए कौन सा task आप करते-करते थक चुके हैं। हम बताएंगे कि AI सही fix है या नहीं।

पहली calls लगभग 20 मिनट चलती हैं — बस उस task के बारे में एक real बातचीत जो आपका समय खा रहा है। आप एक clear next step के साथ निकलेंगे: एक scoped pilot, एक recommendation, या कभी-कभी एक honest "ये spreadsheet problem है, AI problem नहीं।" जो भी fit हो, वही आप सुनेंगे।

— Quinton

मुफ़्त डिस्कवरी कॉल बुक करें hello@attunedventures.com