Making decisions based on noisy evidence requires two distinct computations - accumulation of evidence followed by its categorization to reach a decision. How are these sequential computations performed by recurrently connected brain areas? In rats, these two theoretically defined computations have been mapped onto two interconnected area - anterodorsal striatum (ADS) and frontal orienting fields (FOF) respectively. This provides an intriguing - yet untested - neural implementation of the decision process in which ADS and FOF form a feedforward functional hierarchy. Here, we present results from simultaneous recordings and projection-specific inactivations that challenge this modular, feedforward hypothesis. Contrary to this hypothesis, we found that ADS and FOF are indistinguishable in their timescales and strength of encoding and decoding of the accumulator value. Further, accumulator estimates from the two areas show no discernible lead-lag relationship and their communication subspace carries evidence information, consistent with bi-directional sharing of accumulation related signals. Moreover, such communication is necessary for decision-making, as optogenetic silencing of FOF inputs to ADS impaired decisions throughout the accumulation period, revealing a role for FOF in evidence accumulation that was obscured during nonselective whole-region silencing. Together, our results indicate that the FOF-ADS circuitry participates in evidence accumulation in a distributed, recurrent manner unlike previously hypothesized. We model this distributed implementation using a multi-region recurrent neural network (RNN) with biological constraints on connectivity. We leveraged perturbation data to introduce regional specialization into the RNN,by training it to capture behavior on perturbation and control trials. The trained model reproduces empirical representational redundancies and replicates effects of novel (untrained) perturbations. We show that the models dynamics recover during FOF perturbations, facilitated by recurrent inputs into FOF from ADS, but not during perturbations of its ADS projections. Our novel combination of projection-specific perturbations and RNN modeling offers a promising approach for understanding complex multi-region implementations of simple computations.