ComplyAI Master Data Project
Your single source of truth for all data across the ComplyAI organization
π― Purposeβ
This Master Data Project provides complete transparency and understanding of all data assets across ComplyAI. It serves both technical and non-technical stakeholders with comprehensive documentation of our data ecosystem.
π Documentation Indexβ
| Section | Description | Audience |
|---|---|---|
| Data Dictionary | Complete definitions of all data entities, fields, and relationships | All Teams |
| Data Lineage | Visual flows showing how data moves through our systems | Engineering, Analytics |
| API Documentation | OpenAPI/Swagger specs for all internal and external APIs | Engineering, Partners |
| Entity Relationships | ERD diagrams showing data model connections | Engineering, Product |
| Service Architecture | Microservice documentation and inter-service communication | Engineering |
| Third-Party Integrations | Meta, Stripe, and other platform integrations | All Teams |
| Glossary | Business and technical terminology definitions | All Teams |
| Data Governance | Policies, ownership, and compliance frameworks | Leadership, Compliance |
| Quick Reference | Cheat sheets and common lookup tables | All Teams |
ποΈ System Overviewβ
ComplyAI Platform Servicesβ
βββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ
β COMPLYAI PLATFORM β
βββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ€
β β
β ββββββββββββββββ ββββββββββββββββ ββββββββββββββββ ββββββββββββββββ β
β β Frontend β β Core API β β Main API β β Maestro β β
β β (React) β β (Flask) β β (Flask) β β (Flask) β β
β β β β β β β β β β
β β User Portal β β Auth/Users β β Ad Processingβ β ML Inference β β
β β Dashboard β β Organizationsβ β Webhooks β β Media Processβ β
β β Ad Mgmt β β Business Mgmtβ β Ad Accounts β β Results β β
β ββββββββββββββββ ββββββββββββββββ ββββββββββββββββ ββββββββββββββββ β
β β
β ββββββββββββββββ ββββββββββββββββ ββββββββββββββββ ββββββββββββββββ β
β β Violin β β Triangle β β Gong β β IPU β β
β β (Flask) β β (Flask) β β (Flask) β β (Flask) β β
β β β β β β β β β β
β β AI Services β β Notificationsβ β Integrations β β Image Processβ β
β β Scoring β β Ad Rules β β External API β β Unit Testing β β
β ββββββββββββββββ ββββββββββββββββ ββββββββββββββββ ββββββββββββββββ β
β β
β ββββββββββββββββ ββββββββββββββββ ββββββββββββββββ ββββββββββββββββ β
β β Async API β β CMS β β WWW β β Adhoc β β
β β (FastAPI) β β (Django) β β (React) β β (Scripts) β β
β β β β β β β β β β
β β Async Tasks β β Content Mgmt β β Public Site β β Utilities β β
β β Background β β Wagtail β β Marketing β β Reports β β
β ββββββββββββββββ ββββββββββββββββ ββββββββββββββββ ββββββββββββββββ β
β β
ββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ βββββββ
β
βββββββββββββββββ΄ββββββββββββββββ
βΌ βΌ
ββββββββββββββββ ββββββββββββββββ
β PostgreSQL β β External β
β Database β β APIs β
β β β β
β Users β β Meta Graph β
β Organizationsβ β Stripe β
β Ad Accounts β β Slack β
β Ads/Reviews β β AWS/GCP β
ββββββββββββββββ βββββββββββββ βββ
ποΈ Data Domainsβ
1. Customer Domainβ
- Organizations - Client companies using ComplyAI
- Users - Individual users within organizations
- Subscriptions - Billing and plan information
- Roles/Permissions - Access control
2. Ad Content Domainβ
- Ad Accounts - Facebook/Meta ad accounts
- Business Managers - Meta business manager connections
- Ads - Individual ad content and metadata
- Media Assets - Images, videos, creative content
3. Compliance Domainβ
- Reviews - AI compliance review results
- Violations - Policy violation detections
- Policies - Platform policy rules
- Recommendations - AI-generated fix suggestions
4. Operational Domainβ
- Activity Events - User action audit logs
- Notifications - System alerts and messages
- Webhook Subscriptions - Event subscription tracking
- Issues - Support tickets and feedback
π Key Conceptsβ
For Non-Technical Stakeholdersβ
| Concept | What It Means |
|---|---|
| Ad Account | A Facebook advertising account where ads are created and run |
| Business Manager | A Meta tool that organizes ad accounts, pages, and team members |
| Organization | A ComplyAI customer (company) with one or more users |
| Review | The AI analysis of an ad for policy compliance |
| Webhook | Real-time notifications when something changes (like ad status) |
For Technical Stakeholdersβ
| Concept | Technical Definition |
|---|---|
| System User Token | Long-lived OAuth token for server-to-server Meta API calls |
| OBO (On-Behalf-Of) | Token that allows acting on behalf of a business manager |
| Celery Task | Background job for async processing (ad fetching, scoring) |
| RSID | Run-specific ID for Word document tracked changes |
π Quick Statsβ
| Metric | Count |
|---|---|
| Microservices | 12 |
| Database Tables | 40+ |
| API Endpoints | 150+ |
| External Integrations | 5 |
π Getting Startedβ
For Product/Business Teamsβ
- Start with the Glossary to understand terminology
- Review Data Dictionary for entity definitions
- Check Quick Reference for common lookups
For Engineering Teamsβ
- Review Service Architecture for system design
- Explore API Documentation for endpoint specs
- Study Data Lineage for data flow understanding
- Reference Entity Relationships for data modeling
For Analytics/BI Teamsβ
- Start with Data Dictionary for field definitions
- Review Data Lineage for data pipeline understanding
- Check Data Governance for data quality standards
π Contributingβ
To update this documentation:
- Create a branch from
main - Make your changes in the appropriate section
- Update the changelog in that section's README
- Submit a PR for review
π Last Updatedβ
December 2024 | Version 1.0.0
π₯ Maintainersβ
- Data Team - Overall documentation ownership
- Engineering - Technical accuracy review
- Product - Business context validation