name: portfolio-content-discovery displayName: People Inc Portfolio Content Discovery Workflow description: | Composition that walks the full People Inc brand portfolio to discover and index recent content. For each brand, calls people-inc-rss.fetchBrandFeed to pull the latest articles, then optionally falls back to people-inc-sitemap.fetchSitemapIndex to enumerate URLs the RSS feed does not surface. Designed for governance and inventory use cases — NOT for bulk training-data extraction. version: 0.1.0 owner: api-evangelist tags: - Workflow - Content Discovery - Media - Governance uses: - shared/people-inc-rss.yaml - shared/people-inc-sitemap.yaml inputs: - name: brands type: array items: type: string required: true description: List of People Inc brand domains to walk. outputs: - name: discoveredArticles type: array steps: - id: walkBrands forEach: brands do: - capability: people-inc-rss operation: fetchBrandFeed with: brandDomain: "{item}" - capability: people-inc-sitemap operation: fetchSitemapIndex with: brandDomain: "{item}" onError: continue policy: respectsRobotsTxt: true aiBotPolicyAware: true rateLimitNotes: | No published rate limit. Empirically: requests at human-pace (1 req per second per brand) are reliable; bursts trigger the WAF and surface 403s. Use back-off + UA rotation aligned with declared agent policy.