Director, Information Architect
Los Angeles, CA
Technology and Operations - Information Technology / Full Time / On-site
Apply for this job
ABOUT CIM GROUP
CIM is a community-focused real estate and infrastructure owner, operator, lender, and developer. Our team of experts works together to identify and create value in real assets, benefiting the communities in which we invest. Back in 1994, our three founders focused on projects in Southern California neighborhoods. Today, we are a diverse team of 900+ employees with projects across the Americas. Our projects have delivered jobs; created comfortable places to live, work, and relax; and provided necessary and sustainable infrastructure. Our focus on enhancing communities is unwavering, and we strive to make an even greater impact in the years to come. Join us and make an impact today!
POSITION PURPOSE
The Senior Information Architect will be responsible for designing, developing, and maintaining scalable, efficient, and well-documented data models that support analytical, operational, and reporting needs across CIMs $35B+ portfolio spanning real estate equity, infrastructure, and private credit investments.
This role will establish data modeling standards and techniques from the ground up, developing and managing detailed data models deployed across Databricks Lakehouse as the primary platform, with integration to Snowflake and MongoDB for specific use cases. The Senior Information Architect will play a foundational role in building enterprise data architecture capabilities where mature standards do not yet exist.
RESPONSIBILITIES
Business Partnership & Requirements Gathering • Partners extensively with business stakeholders across Fund Accounting, FP&A, Global Client Group, and Investments teams to understand data requirements, pain points, and use cases. • Translate complex business requirements into robust data models, asking questions first before proposing solutions to ensure alignment with actual business needs. • Collaborate with data analysts, data scientists, MLOps engineers, and application developers to understand technical requirements and ensure models support downstream use cases. • Build trust and influence across teams by demonstrating business value and explaining technical constraints in accessible terms.
Data Model Design & Development • Design and develop conceptual, logical, and physical data models for various data initiatives, including data warehouses, data lakes, operational data stores, and transactional systems. • Specialize in designing highly optimized dimensional models (star schemas, snowflake schemas) for analytical reporting and business intelligence applications. • Apply various data modeling techniques as appropriate: Dimensional Modeling (Kimball), 3NF (Inmon), Data Vault, and NoSQL modeling patterns.
Databricks Lakehouse Architecture • Design and implement medallion architecture (bronze/silver/gold) patterns within the Databricks Lakehouse, establishing standards where none currently exist. • Optimize data models leveraging Delta Lake features including ACID transactions, time travel, schema evolution, Z-ordering, and liquid clustering. • Design partitioning strategies that balance query performance with file management, avoiding over-partitioning while enabling partition pruning. • Implement Unity Catalog namespace hierarchy (catalog, schema, table) for multi-domain, multi-environment data organization and governance. • Collaborate with MLOps engineers on data models that support ML feature stores and GenAI/RAG applications.
Multi-Platform Data Architecture • Design and maintain relational database schemas for both operational and analytical workloads within Snowflake and other RDBMS, ensuring integration with Databricks Lakehouse. • Design and optimize data structures for NoSQL databases (e.g., MongoDB), considering document structures, indexing, and query patterns for specific application needs. • Develop frameworks for deciding when data belongs in Databricks Lakehouse vs. Snowflake vs. MongoDB based on workload characteristics and use cases.
Performance Optimization • Provide input and recommendations on query optimization, indexing strategies, and data partitioning based on data model design. • Diagnose and resolve performance issues including data skew, small files problems, and inefficient join strategies in Spark/Databricks environments. • Collaborate with database administrators and data engineers on OPTIMIZE, VACUUM, and ANALYZE strategies for Delta tables.
ETL/ELT Collaboration & Data Pipeline Design • Work closely with Data Engineers to ensure data models are efficiently implemented and align with ETL/ELT processes using Auto Loader, Delta Live Tables, or traditional Spark jobs. • Provide guidance on data mapping, transformation rules, schema evolution handling, and data loading strategies. • Design slowly changing dimension (SCD) patterns using Delta Lake MERGE operations and Change Data Feed for downstream propagation.
Data Governance & Standards • Establish and enforce data modeling standards, naming conventions, metadata management, and data governance policies-building these foundations where they do not currently exist. • Implement row-level and column-level security patterns using Unity Catalog for sensitive fund and investor data. • Design and maintain data lineage tracking from source systems through bronze/silver/gold layers to final reports. • Contribute to the development and maintenance of a comprehensive data dictionary and metadata repository. • Operate within compliance framework to ensure ethical data handling, regulatory compliance, and consistency across the enterprise.
Documentation & Knowledge Management • Create and maintain detailed data model documentation, including data dictionaries, entity-relationship diagrams (ERDs), data flow diagrams, and data lineage. • Document Lakehouse design patterns, medallion architecture implementations, and platform-specific best practices for team knowledge sharing. • Contribute to data quality initiatives by identifying potential data quality issues at the modeling stage and collaborating on solutions.
EDUCATION/EXPERIENCE REQUIREMENTS
DATABRICKS PLATFORM REQUIREMENTS
MULTI-PLATFORM EXPERIENCE
PREFERRED
DESIRED CERTIFICATIONS
ABOUT YOU
The ideal candidate thrives in an entrepreneurial environment where mature processes and standards do not yet exist. You are energized by the opportunity to build data architecture foundations from scratch rather than inheriting established frameworks.
KEY COMPETENCIES
WHAT SUCCESS LOOKS LIKE
WHAT CIM OFFERS
At CIM, we believe our success stems from our collective efforts, and we are committed to providing well-rounded support and resources for our employees. In addition to a competitive compensation plan, CIM offers a comprehensive benefits program for employees to thrive both inside and outside of work. Eligible employees can enjoy a wide range of benefits, including:
CIM Group is a premier full service urban real estate and infrastructure fund manager with approximately $20.5 billion of assets under management. Since its founding in 1994, CIM has been a process- and research-driven investor that mitigates risk through the fundamental analysis of the long-term drivers in communities. CIM is a relative value investor that systematically targets investments that are priced below their long-term intrinsic value. Over time, CIM has delivered a strong risk-adjusted track record of returns by relying on its vertically-integrated team, investment discipline, and sourcing capabilities.