Data Warehouse Advisory Group: 2-19-26

Data Warehouse Advisory Group: 2-19-26


Meeting Details

Meeting Date:

Feb 19, 2026

Purpose:

Data Warehouse Advisory Group

Participants:

Mark Cohen, Steve Klein, Erik Cooper, Layheng Ting , Pam Mery, Christopher Blackmore, Denice Inciong, Jason Makabali, Jeanae Releford, Kai Yun Pekarsky, Tim Flanagan, Vinod Verma, Virginia/Ginny Moran, Matt Hurley, Jacob Kevari, Eric Houck, Amber Hroch, Jack Thompson, Elaine Kuo, Gayle Pitman, Gene Tjoa

Agenda

Item

Item

1

Review and feedback on Sankey diagrams and other visualizations and their use cases

Sankey Diagram Review & Feedback

Overview of Sankey Visualization

Jack presented a Sankey diagram built in Power BI to visualize individual student course-taking pathways across semesters and colleges.

Key Features

Displays:

  • Semester → College → Course flow

  • Student-level chronological course enrollment

Shows:

  • Courses taken at the user’s college

  • Courses taken at “Other College” (masked if outside district access level)

Supports:

  • Filtering by student success momentum

  • Cohort aggregation (multiple students selectable)

  • Direct Query to Redshift for large fact tables (billions of rows)

Loads quickly via structured fact table (optimized distribution & sort keys)

Student Success Momentum Metric

  • Value between 0 and 1

  • Based on:

    • Recency of course-taking

    • Volume of courses

  • Higher score = stronger momentum

  • Used as filter to identify students potentially needing support

Example use case:

  • Filter: Momentum < 0.85 and > 0.32

  • Identify students who may have slowed enrollment activity.

Discussion emphasized more positive terminology than “intervention.”

Suggested terminology:

  • “Engagement”

  • Integration with Guided Pathways / intrusive advising frameworks

Key Use Cases Discussed

  1. Student-Level Advising

  • Counselor reviewing individual student progression

  • Identify swirl (multi-college attendance)

  • Spot off-ramping in structured programs (e.g., nursing)

  1. Cohort-Level Analysis

  • Analyze structured cohorts

  • Evaluate pathway adherence

  1. Auto-Award / Degree Audit Enhancement

  • Integration with external college coursework

  • Request to expose CB00 course identifiers

  • Potential to:

    • Improve degree audit accuracy

    • Increase proactive transcript requests

    • Support auto-application for degree completion

Reported outcome:

  • Colleges have doubled/tripled degree completion rates using proactive outreach.

  1. Swirl Tracking

  • District-level users can see named colleges within district

  • Chancellor’s Office sees all colleges

  • College-level users see “Other College” for outside institutions

Requested Enhancements

Enhancement

Status

Add modality dimension

Planned in sprint

Add success rates (individual & aggregate)

Under consideration

Include gateway course flags

Possible

Expose CB00 identifiers

Can be provided

Add Clearinghouse integration

Potential (college-uploaded licensed data)

Visualization Feedback

Concern:

Sankey diagrams can be difficult to explain to non-technical audiences.

Suggested Alternatives:

  • Aggregated bar charts

  • Cohort-level summaries

  • Traditional dashboard views layered over fact table

  • Structured program sequence views

Conclusion:
Sankey remains useful for individual-level drilldown; alternative visuals recommended for executive or aggregated reporting.

2

Exploration of AI Applications to Enhance Data Warehouse Development and Utilization (continued)

AI Applications in the Data Warehouse (Continued Discussion)

AI Opportunity Areas Identified

  1. AI-Assisted Data Navigation

Jack demonstrated an AI-powered MIS documentation assistant using:

  • Large Language Model (LLM)

  • Retrieval-Augmented Generation (RAG)

  • Vector store of MIS documentation

Capabilities:

  • Answer natural language questions

  • Generate sample SQL queries

  • Explain data quality and collection changes

  • Clarify table locations

Example Use Cases:

  • “Where is financial aid data stored?”

  • “Where can I find demographic data for research?”

Guardrails:

  • Limited to public documentation

  • No direct student data access

  • Queries provided for researcher validation

  1. AI for Data Governance

Raised by Craig:

Potential applications:

  • Automating repetitive governance tasks

  • Metadata management

  • Documentation assistance

  • Data lineage validation

  1. AI for MIS Error Detection & Anomaly Detection

Discussed by Christopher & Loris:

Use cases:

  • Detect cohort inconsistencies

  • Flag anomalous student status changes

  • Identify single errors generating thousands of downstream errors

  • Pre-submission MIS validation

Opportunity:

  • AI-driven anomaly detection pre-submission

  • Auto-feedback loops to departments

  • Reduced manual MIS workload

Open question:

  • Should validation occur pre- or post-submission?

  • Integration with Common Cloud MIS tools?

  1. AI in Student Support

Aligned with Vision 2030 priorities:

Potential:

  • Force multiplier for student support staff

  • Predictive engagement signals

  • Enrollment fraud detection

  • Natural language data interface

Craig noted:

  • Risk tolerance varies across system

  • AI governance guidance memo forthcoming

  • AI microsite launching to centralize initiatives

  1. AI Agents & Security Considerations

Discussion points:

  • Not ready for AI agents directly executing warehouse queries

  • Risk tolerance currently moderate-to-low

  • Importance of guardrails

  • Potential use of graph databases for join validation

Concept introduced:
“Vibe coding” — AI generating SQL dynamically (future possibility).

Issues/Questions Resolved

Issue/Question

Resolution/Answer

Issue/Question

Resolution/Answer

1

 

 

Issues/Questions Needing Resolution

Issue/Question

Resolution/Answer

Issue/Question

Resolution/Answer

1

 

 

 

 

Action Items/Next Steps

Item

Notes

Item

Notes