Best math programs for elementary students: an evidence-linked decision guide

This guide helps instructional leaders select best math programs for elementary students by aligning evidence-based criteria with local goals, resources, and implementation.

May 21, 2026

Editorial illustration for Best math programs for elementary students: an evidence-linked decision guide.

Overview

Choosing the best math programs for elementary students is a fit judgment, not a single correct answer. The right program depends on student demographics, instructional capacity, technology constraints, and standards context.

Independent quality signals help identify materials that align to standards and embody sound design. EdReports evaluates curricula across Focus and Coherence, Rigor and Mathematical Practices, and Usability. The What Works Clearinghouse reviews programs for evidence of student-learning impact. Implementation capacity, however, is often the largest determinant of whether a program produces results in practice. This guide focuses on grades K–5 and is written for instructional leaders, curriculum directors, teacher-leads, and principals preparing for an adoption cycle or troubleshooting a current program.

The guide draws on independent quality signals, market reporting from Education Week, and practitioner commentary. It favors language that reflects evidence limits rather than overstating claims. Start by calibrating what "best" must accomplish in your context. Use the program snapshots to build a shortlist, stress-test that shortlist with the decision matrix, and then move from shortlist to pilot using the implementation timeline, cost, and procurement guidance below.

Who this guide is for and how to use it

This guide is primarily for school and district instructional leaders evaluating K–5 math programs for adoption or re-adoption. A secondary audience includes homeschool coordinators and microschool leaders seeking school-grade rigor without full district infrastructure.

Use the "How 'best' varies" section to define your selection criteria, then review program snapshots to build a shortlist. Use the decision matrix to filter that shortlist against your constraints — technology, ELL needs, PD capacity. Follow the implementation timeline and cost sections to move from pilot to scaled rollout.

If technology, access, or compliance is your chief concern, skip ahead to those sections for operational checks.

How 'best' varies by goal: outcomes, access, and constraints

Treating "best" as a fixed quality label is the most common selection error. Instead, match program design to your goals and constraints.

A curriculum can score highly on EdReports — showing alignment to standards and coherent rigor — yet fail in a context without sufficient PD, time, or technology infrastructure. Conversely, a program with dense procedural practice might show quick assessment gains but leave conceptual foundations weak for later topics such as fractions and early algebra. Research summarized by Accelerate Learning points to conceptual understanding, classroom discourse, and formative assessment as defining features of high-quality elementary math programs — criteria that matter alongside raw alignment scores.

Name your goals up front: improving state-assessment proficiency, building conceptual depth and primary-grade number sense, reducing math anxiety while increasing engagement, supporting English Language Learners, or closing large learning gaps. Each goal emphasizes different features. Discourse- and problem-based curricula support long-term conceptual growth but require teacher facilitation skill; they may not show large year-one test gains. Practice-heavy programs can boost short-term fluency but risk gaps in mathematical reasoning over time.

Worked example — calibrating "best" to your context:

A Title I K–5 school with 68% English Language Learners, one shared device cart per grade, and a part-time instructional coach (approximately four hours per week) faces a specific set of constraints. Limited devices and intermittent coaching make daily device-dependent programs and full-school launches impractical. Bilingual or substantial ELD supports become high priorities, not optional extras. In this setting, prioritize print-complete materials with documented bilingual supports, strong teacher-facing usability, and publisher resources that do not assume daily 1:1 device access. EdReports Usability indicators — particularly those covering supports for teachers and English Language Learners — matter more than raw alignment scores. The practical decision: shortlist programs that meet EdReports "Meets Expectations" across all gateways, confirm print and bilingual availability directly with the publisher, and pilot a single grade with the coach before scaling. This sequencing limits risk and generates local evidence that internal stakeholders find more persuasive than publisher efficacy studies conducted elsewhere.

Core curricula vs supplemental programs

Decide first whether you need a core curriculum or a supplemental tool. A core provides a complete, coherent year-long instructional sequence — lesson plans, formative assessments, student materials. A supplemental program is designed to extend, reinforce, or diagnose, not to replace primary instruction.

Adoption implications differ substantially. Cores demand comprehensive PD on lesson facilitation, pacing, and assessment use. Supplements require an integration protocol that specifies when, for how long, and how data feed back into core instruction. Problems occur when districts treat supplements as cores. Adaptive practice platforms and game-based tools are typically built for targeted practice on discrete skills; they rarely develop sustained conceptual understanding across K–5.

A simple practical rule helps clarify the distinction: if a student can complete the program's daily activities without encountering novel problems, producing written arguments, or participating in discussion, the program is supplemental, not core. Illustrative Mathematics (IM) K–5, Bridges in Mathematics, and Eureka Math/EngageNY are examples of core curricula. Zearn functions primarily as a digital companion to IM and sits between core and supplemental roles depending on how a school deploys it.

Program snapshots: IM, Bridges, Eureka/EngageNY, and Zearn—where each tends to fit

These four programs are widely discussed in K–5 adoption contexts. The notes below summarize typical fits and limitations, drawing on independent reviews where available. Education Week's 2023 curriculum database reporting found that the most commonly district-selected elementary math curricula include Envision Mathematics and Into Math — programs that do not always top independent reviewer lists — highlighting that market prevalence and quality signals do not always align.

Illustrative Mathematics (IM) K–5: IM is a problem-based, discussion-focused curriculum organized around units with clear conceptual arcs. It has received EdReports "Meets Expectations" ratings across gateways. It assumes teachers skilled in inquiry facilitation and productive mathematical discourse. With limited PD, teachers often shorten exploratory tasks and default to direct instruction, undermining design intent. IM's print materials are complete and usable without digital access, and Spanish editions and ELD supports are documented in publisher materials. Many districts pair IM with Zearn as a digital companion rather than using Zearn as the sole core.

Bridges in Mathematics: Bridges uses a workshop model with explicit visual and problem-solving components and has strong EdReports ratings. It is often described as more teacher-friendly in lesson structure, offering more scripting and scaffolding for facilitation. Bridges emphasizes manipulatives and visual models, which help students who struggle with abstraction. Print materials are comprehensive; digital tools are supplementary, not required.

Eureka Math / EngageNY: Originally released as EngageNY and later published commercially as Eureka Math, this curriculum is widely adopted and carries strong EdReports alignment ratings. It is rigorous and knowledge-intensive with detailed lesson scripts. That density supports deep learning but can create pacing pressure and heavy teacher preparation demands. EngageNY's free online availability makes it accessible for districts with limited budgets. Districts using the free materials without print kits or PD commonly report implementation challenges and family confusion about new lesson formats.

Zearn K–5: Zearn is a digital-first program closely integrated with IM lesson structures and is commonly used as a supplemental or companion resource. It requires reliable device access and is not viable as a primary program in low-bandwidth or limited-device contexts unless paired with IM's print core materials.

Beast Academy: For students needing accelerated challenge, Beast Academy offers a problem-solving-focused curriculum and illustrated books for grades 1–5, available in online and print formats. It is typically used alongside a school's core curriculum to provide enrichment rather than replace primary instruction.

Decision matrix: match features to your context

This checklist helps you prioritize program features before requesting demos or reading publisher marketing. Use it to filter your shortlist, not to make a final selection.

If your primary context is high ELL or dual-language instruction, prioritize publisher-supplied Spanish editions or full bilingual student materials, built-in English Language Development supports within lesson plans, visual models and manipulative-heavy lessons that reduce language load, and family communication materials in multiple languages.

If your primary context is limited device access or low bandwidth, prioritize print-complete student and teacher materials (core instruction must not depend on digital access), downloadable or offline teacher resources, and optional supplemental digital tools that do not affect lesson fidelity when unavailable.

If your primary context is mixed-grade or multi-age classrooms, prioritize a scope-and-sequence that supports looping or compacting across grades without breaking coherence, teacher guides explaining progression across grades and not only within a grade, and built-in flexible grouping suggestions.

If your primary context is limited PD capacity or high teacher turnover, prioritize strong scripting and facilitation notes in teacher guides, high Usability ratings on EdReports (especially indicators for supporting teacher learning), and publisher-provided on-demand training videos or embedded lesson-level coaching notes.

If your primary context is significant unfinished learning or remediation needs, prioritize diagnostic assessments that identify prerequisite gaps as well as grade-level readiness, intervention modules explicitly linked to core lessons, and MTSS/RTI alignment documentation from the publisher.

If your primary context is strong PD capacity and readiness for problem-based instruction, prioritize full EdReports "Meets Expectations" ratings with strong rigor and mathematical practice integration, problem-based lesson structures that support student discourse, and formative assessment architecture with checks embedded in each lesson.

Implementation timeline: 90-day pilot and 12-month rollout

Implementing a chosen program with fidelity is harder than choosing it. The following framework is a starting point — adjust to your PD calendar, adoption rules, and teacher capacity.

Months 1–3: Foundation and pilot launch. Begin with a materials audit and technology readiness check. Confirm print kits, test digital access on actual devices, and complete SSO/rostering setup before students use the program. For district-scale rostering (Clever, ClassLink, SAML), allow at least two weeks for IT coordination. Provide a minimum of 8–10 hours of initial PD before teachers teach the first lessons, covering design principles, lesson structure, and facilitation of common lesson types. Identify a pilot grade or two and assign an instructional coach or lead teacher as the fidelity support. Use a simple fidelity check to confirm teachers are completing launch, activity, discussion, and synthesis components of each lesson.

Months 4–6: Data review and mid-course adjustment. Collect interim assessment and observation data and conduct structured teacher debriefs. Watch for common failure modes: shortened or skipped problem-based activities, pacing lagging behind the scope-and-sequence, and supplemental tools displacing core time. Use formative-data tools that reveal student thinking — not just correct/incorrect counts — to diagnose whether gaps stem from curriculum design, pacing, or facilitation. Adjust pacing or PD focus based on evidence from assessments and classroom observations.

Months 7–9: Expanding the cohort. If pilot data are positive, expand to additional grades with a second cohort of PD. Plan for roughly 6–8 hours of PD per new grade team before launch. Continue coaching cycles and collect fidelity and student-learning data as new grades onboard. Communicate proactively with families — use publisher family guides and translated materials to explain changes in lesson structure or homework expectations.

Months 10–12: District-level fidelity and planning for year two. Conduct a full implementation review. Identify grades meeting pacing expectations, teachers needing coaching, and whether supplemental tool usage aligns with core goals. Plan year-two PD to deepen teacher practice — for example, facilitating mathematical argumentation and using formative data to adjust same-day instruction — rather than repeating introductory training. Document decisions and rationale for stakeholders, especially when adoption required significant budget or policy approvals.

Cost and total cost of ownership factors

The per-student license is rarely the largest cost driver. Build a multi-year total cost of ownership that includes materials, PD, shipping, and staffing.

When building a budget request, enumerate these cost drivers: consumable student workbooks (annual repurchase), teacher editions and resource sets (one-time or multi-year), physical manipulative kits, initial PD and coaching (often 20–40 hours per teacher in year one for problem-based curricula), ongoing coaching and follow-up PD, print shipping costs for large adoptions, digital platform licensing (annual), and staff time for curriculum coordinators managing the process. Expect a three- to five-year horizon before implementation maturity. Budgeting only year-one materials and a single PD event commonly underestimates true costs.

Procurement levers to explore include state or regional purchasing co-ops, Title I or ESSER funds for eligible schools, and publisher discounts or pilot offers for limited classrooms. For schools with Title I designation or nonprofit status, raise that status during sales conversations — publishers and ed-tech vendors occasionally offer structured discounts for qualifying schools, and those rates are not always listed publicly.

Technology, access, and privacy essentials

Before finalizing any program with a digital component, verify device, rostering, and privacy requirements with IT and the publisher.

Confirm device and bandwidth minimums and test program behavior on your actual devices and networks. "Offline" modes vary from full functionality to limited feature sets — confirm which features are unavailable offline before piloting. Rostering and SSO (Clever, ClassLink, SAML) are frequent implementation delays; confirm support and timelines early, and add a technology-readiness checkpoint to your pilot launch checklist.

Review the publisher's data processing agreement against district requirements for FERPA and COPPA compliance; COPPA considerations apply for students under 13. Ask whether student work is used to train vendor models, how PII is retained after contract end, and whether SOC 2 or similar third-party audits have been completed. Require explicit written answers on model-training use and data retention before signing contracts. This question matters for AI-powered tools: some platforms use student work to improve their models unless districts opt out or contractual protections are in place. Frizzle's pricing page documents that its Institution tier includes a custom DPA, FERPA and COPPA compliance, and SOC 2 Type II — the kind of documentation district procurement teams typically require before approving a tool at scale.

Accessibility and language supports

Evaluate accessibility both technically and pedagogically. For technical checks, request platform WCAG 2.1 conformance documentation, VPATs, and screen-reader testing results. Request any Section 508 documentation required for federally funded procurements.

For pedagogical accessibility, review visual supports, multiple representations, and reduced language load. Check EdReports indicators related to supports for students with disabilities and English Language Learners in the Usability gateway — these provide independent reviewer evidence rather than relying solely on publisher marketing claims.

For multilingual campuses, confirm whether Spanish or other language editions are full translations of core student materials or limited adaptations such as glossaries, sentence frames, or family materials only. Full translations matter for true dual-language implementation. If Spanish is not the primary community language, ask about other supported languages and the depth of those resources before committing to a program.

Pairing core curricula with supplemental tools without overload

Supplemental tools commonly fail when they compete with core instruction time. If a 60-minute math block devotes 20 minutes to an adaptive practice platform, core lessons designed for full periods are truncated. Exploratory and discussion components are often cut first. The result is a shift toward lecture-plus-practice that undermines problem-based curricula.

Assign supplemental tools to distinct, non-competing purposes: diagnostics at unit starts or intervention windows, fluency practice in a separate 5–10 minute block, and enrichment in extension time. Before adopting a supplemental tool, answer two questions: what specific gap in our core does this tool fill, and how will teachers use its data to adjust core instruction? If neither question has a concrete answer, the tool is likely to add reporting burden without instructional benefit.

Tools such as i-Ready, ST Math, and DreamBox serve different supplemental roles — diagnostic, visual or conceptual practice, or adaptive reinforcement — and should be chosen and scheduled accordingly. AI-powered grading tools occupy a different role still: they can surface step-level misconceptions from handwritten work that end-of-unit assessments typically miss, giving teachers actionable data without adding a separate student-facing platform. Frizzle, for example, uses computer vision to parse each step of handwritten student work, flags 147 named misconceptions mapped to standards, and feeds results into a live dashboard showing which errors are spreading across a class — without requiring students to log in or change how they work on paper. That kind of tool integrates into existing core instruction rather than competing with it.

Edge cases and special contexts

Rural mixed-grade classrooms. Single teachers covering K–2 or 3–5 need curricula with explicit cross-grade progression guidance so they can loop or compact lessons coherently. Bridges' visual approach and manipulative emphasis is often cited by small-school practitioners as adaptable for flexible grouping, while highly scripted single-grade programs can be harder to compress across age spans.

Frequent mid-year transfers. Tightly sequenced curricula create challenges for mid-year entrants. Programs with unit-level diagnostic pre-assessments give receiving teachers faster, actionable information about what a new student has covered and where gaps may have formed.

Unreliable internet. Treat digital components as optional enhancements in low-bandwidth contexts. Confirm that print materials are a complete instructional experience — Bridges, Eureka/EngageNY, and IM offer full print editions that can operate without digital dependency.

High-ELL populations. Most mainstream curricula offer ELL supports as add-ons rather than core design features. For majority-ELL schools, select curricula with strong visual models and discourse-rich lessons, pair them with explicit ELD protocols developed by language specialists, and prioritize full-language editions where available.

Students with dyscalculia or math anxiety. No mainstream K–5 curriculum is specifically designed for dyscalculia. Prioritize materials that use multiple representations (concrete, pictorial, abstract), allow nonstandard solution paths, include explicit number-line and area-model work, and surface student thinking in formative tools rather than only correct/incorrect scores. Grading tools that parse stepwise work and recognize partial-credit solution paths can help identify patterns that end-point assessments miss entirely — for instance, a student who consistently applies correct logic but makes the same procedural error at a specific step will look very different in step-level data than in a final-answer score.

Procurement and funding basics

Curriculum adoptions commonly run on 5–7 year cycles. Confirm any district or bargaining constraints before beginning an evaluation to avoid wasted committee work.

A pilot before full adoption is usually worth the investment. Publishers typically offer 30–90 day pilots; use them to gather implementation data — teacher confidence, lesson completion rates, student assessment results — rather than relying solely on publisher efficacy studies. Title I and ESSER funds can support materials, PD, and coaching for eligible schools; purchasing co-ops often secure lower per-student rates than direct quotes. Document the selection process thoroughly — criteria, programs reviewed, pilot data, stakeholder input, and final rationale — for board approval and future reference.

FAQs

What is the practical difference between a core elementary math curriculum and a supplemental program?

A core curriculum delivers a complete, coherent instructional sequence for a grade — lessons, assessments, student materials — and is intended as primary instruction. A supplemental program reinforces, diagnoses, or extends; it is not designed to be the sole source of instruction. Using a supplemental tool as a de facto core typically creates gaps in conceptual reasoning and mathematical language development that surface in later grades.

Which K–5 programs show independent quality signals on EdReports, and how should leaders interpret those ratings?

EdReports evaluates curricula across Focus and Coherence, Rigor and Mathematical Practices, and Usability. Programs that "Meet Expectations" across gateways have been independently reviewed for alignment and instructional quality; examples include Illustrative Mathematics K–5, Bridges in Mathematics, and Eureka Math. EdReports assesses published materials, not implementation fidelity, so leaders should interpret ratings as one input alongside PD capacity and contextual fit — not as a guarantee of outcomes.

Do any leading elementary math programs have ESSA Tier I or II evidence, or What Works Clearinghouse reviews?

The What Works Clearinghouse periodically reviews curricula; check the WWC database for current entries, as coverage varies by program and grade band. ESSA Tier I (randomized trials) and Tier II (strong quasi-experimental studies) evidence is less common for K–5 core curricula, and program efficacy often depends heavily on implementation quality. Treat publisher-funded studies with caution and seek independently conducted research where possible.

Which programs align with TEKS rather than Common Core, and what adjustments are needed?

Texas uses TEKS; some publishers offer TEKS-specific editions or crosswalks. Verify that a TEKS edition or comprehensive crosswalk exists for all grades you plan to adopt. When crosswalks are used, review them carefully to identify supplementation needs at specific grade levels. Some tools and vendors provide multi-state alignment documentation — including CCSS, TEKS, NGSS, and 30+ state frameworks — that can simplify this step.

Are these curricula usable in homeschools or microschools without district infrastructure?

Core curricula assume teacher-facilitated classroom structures and peer discourse; homeschools and microschools can use them but must substitute for facilitation and group interaction. Publishers and alternative providers — for example, Beast Academy for enrichment-focused instruction or homeschool-focused publishers reviewed by communities on YouTube and curriculum comparison sites — offer materials or editions more suited to independent or parent-led instruction.

How much do elementary math curricula typically cost per student?

Per-student license figures vary widely with district size, contract terms, and whether print and digital are included. Request current per-student quotes from publishers and compare against purchasing co-op rates in your state. Budget for consumables, PD, shipping, and staffing — these often exceed the base license in year one. For AI-powered supplemental tools, pricing models vary from free tiers to flat annual rates to enrollment-based contracts; Frizzle's pricing page shows one example of a freemium structure (free forever for individual teachers up to stated limits, $200/year for the Pro plan, and institution-scale pricing on request) that can help budget planners understand the range of models in this category.

---

Making your decision: The clearest path forward is to name your three most important selection criteria before reviewing any program, then run each shortlisted option against those criteria rather than against a generic "best" list. If you are unsure where to start, EdReports ratings and a structured 30-day pilot with teacher feedback will generate more locally useful evidence than any published ranking. For technology and AI-powered tools, match the tool's role to a specific instructional gap, verify privacy documentation before piloting at scale, and build integration into existing workflows rather than adding a parallel system. That sequence — criteria first, local pilot second, scale with documentation third — is the most reliable path to a defensible, durable adoption.

Keep reading

Best Math Curriculum for Dyslexia: How to Choose What Really Fits Your Learner

Use a dyslexia-informed checklist and practical criteria to select math curricula that align with learners’ cognitive profiles, instructional capacity, and multisensory needs for.

Best AI Tools for Teacher Productivity: A Workflow‑First Guide

Explore a workflow-first approach to selecting AI tools that streamline grading, communication, and material creation while ensuring privacy, integration, and sustained teacher.

Assessment tools for teachers: a practical decision guide

Assessment tools for teachers: a guide to selecting tools that meet privacy, accessibility, device, interoperability, and analytics needs to support effective, data-informed.