# Robots.txt for DiscoverFashions # https://www.discoverfashions.com/robots.txt # Last Updated: 2025-12-04 # =========================================== # RELATED FILES FOR BOTS AND AI SYSTEMS # =========================================== # AI Policy: https://www.discoverfashions.com/ai.txt # AI Feed (JSON): https://www.discoverfashions.com/ai-feed.json # AI Sitemap (XML): https://www.discoverfashions.com/ai.xml # LLMs.txt: https://www.discoverfashions.com/llms.txt # Security: https://www.discoverfashions.com/.well-known/security.txt # Sitemap: https://www.discoverfashions.com/sitemap.xml # =========================================== # DEFAULT RULES FOR ALL CRAWLERS # =========================================== User-agent: * Allow: / Disallow: /api/ Disallow: /_next/ Disallow: /admin/ Disallow: /private/ Crawl-delay: 1 # =========================================== # SEARCH ENGINE BOTS # =========================================== # Google User-agent: Googlebot Allow: / Disallow: /api/ Disallow: /_next/ Crawl-delay: 1 User-agent: Googlebot-Image Allow: /images/ Allow: / Disallow: /api/ User-agent: Googlebot-News Allow: / # Bing User-agent: Bingbot Allow: / Disallow: /api/ Disallow: /_next/ Crawl-delay: 2 User-agent: msnbot Allow: / Disallow: /api/ Crawl-delay: 2 # Yahoo User-agent: Slurp Allow: / Disallow: /api/ Crawl-delay: 2 # DuckDuckGo User-agent: DuckDuckBot Allow: / Disallow: /api/ Crawl-delay: 2 # Yandex User-agent: Yandex Allow: / Disallow: /api/ Crawl-delay: 3 # Baidu User-agent: Baiduspider Allow: / Disallow: /api/ Crawl-delay: 3 # =========================================== # AI AND LLM CRAWLERS # For detailed AI policy, see: /ai.txt # For LLM-optimized content, see: /llms.txt # =========================================== # OpenAI GPT User-agent: GPTBot Allow: / Disallow: /api/ Disallow: /_next/ Disallow: /admin/ Crawl-delay: 10 User-agent: ChatGPT-User Allow: / Disallow: /api/ Crawl-delay: 10 # Anthropic Claude User-agent: ClaudeBot Allow: / Disallow: /api/ Disallow: /_next/ Crawl-delay: 10 User-agent: Claude-Web Allow: / Disallow: /api/ Crawl-delay: 10 User-agent: anthropic-ai Allow: / Disallow: /api/ Crawl-delay: 10 # Google AI User-agent: Google-Extended Allow: / Disallow: /api/ Disallow: /_next/ Crawl-delay: 10 # Apple User-agent: Applebot Allow: / Disallow: /api/ Crawl-delay: 5 User-agent: Applebot-Extended Allow: / Disallow: /api/ Crawl-delay: 10 # Perplexity AI User-agent: PerplexityBot Allow: / Disallow: /api/ Disallow: /_next/ Crawl-delay: 10 # ByteDance User-agent: Bytespider Allow: / Disallow: /api/ Crawl-delay: 10 # Common Crawl User-agent: CCBot Allow: / Disallow: /api/ Crawl-delay: 10 # Cohere User-agent: cohere-ai Allow: / Disallow: /api/ Crawl-delay: 10 # Diffbot User-agent: Diffbot Allow: / Disallow: /api/ Crawl-delay: 10 # Meta/Facebook User-agent: FacebookBot Allow: / Disallow: /api/ Crawl-delay: 5 User-agent: Meta-ExternalAgent Allow: / Disallow: /api/ Crawl-delay: 10 User-agent: Meta-ExternalFetcher Allow: / Disallow: /api/ Crawl-delay: 10 # You.com User-agent: YouBot Allow: / Disallow: /api/ Crawl-delay: 10 # Omgili User-agent: omgili Allow: / Disallow: /api/ Crawl-delay: 10 # Amazon/Alexa User-agent: Amazonbot Allow: / Disallow: /api/ Crawl-delay: 10 # AI Search Engines User-agent: AI2Bot Allow: / Disallow: /api/ Crawl-delay: 10 User-agent: Ai2Bot-Dolma Allow: / Disallow: /api/ Crawl-delay: 10 # =========================================== # SOCIAL MEDIA BOTS # =========================================== User-agent: Twitterbot Allow: / User-agent: LinkedInBot Allow: / User-agent: Pinterest Allow: / Allow: /images/ User-agent: facebookexternalhit Allow: / # =========================================== # SEO AND ANALYSIS TOOLS # =========================================== User-agent: AhrefsBot Allow: / Disallow: /api/ Crawl-delay: 5 User-agent: SemrushBot Allow: / Disallow: /api/ Crawl-delay: 5 User-agent: MJ12bot Allow: / Disallow: /api/ Crawl-delay: 5 User-agent: DotBot Allow: / Disallow: /api/ Crawl-delay: 5 # =========================================== # BLOCKED BOTS (Aggressive/Malicious) # =========================================== User-agent: AhrefsBot/7.0 Disallow: / User-agent: MJ12bot/v1.4 Disallow: / User-agent: SemrushBot-SA Disallow: / User-agent: BLEXBot Disallow: / User-agent: dotbot Disallow: / User-agent: Sogou Disallow: / User-agent: MegaIndex Disallow: / User-agent: ltx71 Disallow: / User-agent: Screaming Frog Disallow: / # =========================================== # SITEMAPS AND RESOURCES # =========================================== Sitemap: https://www.discoverfashions.com/sitemap.xml Sitemap: https://www.discoverfashions.com/ai.xml # =========================================== # ADDITIONAL INFORMATION # =========================================== # Website: https://www.discoverfashions.com # Contact: info@discoverfashions.com # AI Policy: https://www.discoverfashions.com/ai.txt # AI Feed (JSON): https://www.discoverfashions.com/ai-feed.json # AI Sitemap (XML): https://www.discoverfashions.com/ai.xml # LLMs.txt: https://www.discoverfashions.com/llms.txt # Security: https://www.discoverfashions.com/.well-known/security.txt # End of robots.txt