Design Article

IMG1

How to implement DSP algorithms using the Xilinx Spartan-3E starter board

Dennis Silage

4/2/2008 12:34 PM EDT

Editor's Note: This "How To" tutorial is an excerpt from a recently published book: Embedded Design Using Programmable Gate Arrays by Dennis Silage (ISBN-13: 978-1589094864).

Dennis is a Professor in the department of Electrical and Computer Engineering in the College of Engineering at Temple University, Philadelphia, USA. This text is intended as a supplementary text and laboratory manual for undergraduate students in a contemporary course and for professionals who have not had an exposure to the FSM, the controller-datapath construct, and the Xilinx design environment.

The book provides complete Xilinx ISE FPGA projects in Verilog using finite state machines (FSM), controller-datapath modules, and Xilinx LogiCORE blocks for real-time processing in DSP, digital communications, and digital control on the Xilinx Spartan-3E Starter Board.

Also, for those who always ask, the author informed me we can be reassured that: "All of the code in this text has been 'battle tested' by execution on the evaluation board." Having run into problems with the code provided in other books in the past, this is certainly good to know.


Introduction
Embedded design in Verilog using FPGAs can utilize controller and datapath modules to facilitate the implementation of real-time tasks. The controller module accepts external control and status signals from the datapath module and uses one or more finite state machines (FSM) to coordinate the process. The controller module provides the datapath module control input signals that route the input data, perform processing and output the data. The datapath module stores and manipulates data in registers using combinational and sequential logic and can use one or more FSMs to output the data but not autonomously. The controller can also accept external control signals from and return status signals to an external processor or internal soft core processor to augment the performance of the embedded system.

The controller and datapath construct partitions the design into modules that can be separately verified in simulation. Rather than one module that encapsulates the entire process, the controller and datapath modules then each have a reduced number of interconnections which facilitates the Verilog structural and behavioral synthesis into FPGA hardware. Datapath modules also support the concept of design reuse.

The controller module can be easily modified to accommodate a new task, which can then even include additional datapath modules. The configuration of a typical controller and datapath construct is shown in Fig 1. The synchronous clock input schedules the state transitions of the FSMs of the controller and datapath. Registers can be initialized by a global reset signal, a local reset signal or by a declaration in the behavioral synthesis of the controller and datapath. The reset signal is not shown in Fig 1.


1. Configuration of a typical controller and datapath construct.

The controller has control input logic signals that initiate the process and status output logic signals that signify the completion of the process. The datapath has only data as an input and output and no external process control logic signals other than those derived from the controller. The datapath outputs status logic signals to the controller to coordinate the process. A clock signal is used to provisionally evoke a state transition in the FSM if utilized in the controller and datapath. The controller control signals and the datapath status signals are required to have a state transition.

DSP System
The DSP system consisting of an ADC, FPGA and DAC is shown in Fig 2. The ADC provides n-bit data to the FPGA and receives an a-bit data packet for command and control. The DAC receives both m-bit data and a d-bit data packet for command and control from the FPGA. A crystal oscillator provides a clock signal to the FPGA for synchronization and timing of the data transfers and to establish the sampling rate fs of the DSP system.


2. DSP embedded hardware system.

This DSP system executes on the Xilinx Spartan-3E Starter Board using Verilog structural and behavioral synthesis modules that are developed using the Xilinx ISE tools. The Verilog modules are configured as FSMs and the controller and datapath construct. The DSP system is initially implemented as a straight-through processor that inputs and outputs an analog signal without any manipulation to assess the maximum data throughput rate with the Verilog top module s3eadcdac.v in Listing 1. The five Verilog modules operate in parallel in the top module.


Listing 1. ADC-DAC system top module for the
Xilinx Spartan-3E starter board.
(Click this image to view a larger, more detailed version)

The Verilog top module utilizes the se3adc.v and se3progamp.v modules for the ADC and programmable gain amplifier (PGA) and the se3dac.v and dacs3edcm.v modules for the DAC and the Digital Clock Manager (DCM) of the Xilinx Spartan-3E Starter Board. The DCM is an available Xilinx Architecture Wizard module which facilitates the embedded design. These four Verilog modules are FSM controllers for the ADC, PGA and DAC external peripherals and the soft-core peripheral DCM and are described in the text.

An output pin of one of the 6-pin peripheral ports is used to monitor the ADC conversion command signal (conad) and the sampling rate fs = approximately 282 ksamples/sec here. This maximum sampling rate is far below the cutoff frequency of the anti-aliasing filter of the Spartan-3E Starter Board ADC (1.54 MHz). To avoid aliasing, an analog filter with a cutoff frequency fcutoff < fs/2 or fcutoff < 141 kHz would have been appropriate.

Although data from the two ADC channels are available simultaneously, the DAC can only accept one channel of data at this sampling rate. In addition, the Xilinx Spartan-3E Starter Board DAC shares access to the serial peripheral interface (SPI) bus with the ADC (and other external peripherals) which leads to SPI bus contention and an even lower sampling rate.

The Xilinx Spartan-3E Starter Board ADC outputs 14-bit two's complement binary data and the DAC requires 12-bit straight binary data. The conversion is accomplished by complementing the MSB of the ADC output (bit 13) as the sign bit. The twelve MSBs of the ADC output (bit 13 through bit 2) are outputted to the DAC. Since the PGA inverts the analog input signal, the resulting straight binary data is complemented for comparison of the analog input and output signals on an oscilloscope. The module genadcdac.v in the top module is a three state FSM controller for the ADC-DAC system. The ADC 14-bit signed 2's complement data is converted to a 12-bit straight binary data for the DAC. Slide switch SW0 selects one of the two available ADC channels. The DCM frequency synthesizer module dacs3edcm.v inputs the 50 MHz crystal clock oscillator and outputs a 50 MHz × 5/3 = 83.333 MHz clock for the DAC. The DAC has a maximum SPI bus rate of 50 Mb/sec and is running at 83.333/2 = 41.666 Mb/sec from the controller.


Listing 2. ADC-DAC system controller module for the
Xilinx Spartan-3E starter board.
(Click this image to view a larger, more detailed version)

For comparison, a second straight-through ADC-DAC system utilizes the Digilent PmodAD1 ADC and Digilent PmodDA2 DAC and is described in the text. The PmodAD1 and the PmodDA2 hardware modules are connected to two of the 6-pin peripheral module ports of the Xilinx Spartan-3E Starter Board. An output pin of another of the 6-pin ports is again used to monitor the ADC conversion command signal and the sampling rate is fs = approximately 714 ksamples/sec. This maximum sampling rate is above the cutoff frequency of the anti-aliasing filter of the Digilent PmodAD1 ADC (500 kHz). To avoid aliasing, an analog filter with a cutoff frequency fcutoff < fs/2 or fcutoff < 357 kHz would be appropriate.

Although data from the two channels of the PmodAD1 ADC are available simultaneously, the PmodDA1 DAC can only accept one channel of data at this sampling rate. However, unlike the ADC and DAC of the Spartan-3E Starter Board, the PmodAD1 and PmodDA2 do not share an SPI data communication channel and can be processed in parallel. The output data of the ADC and the input data of the DAC are both 12-bit and utilize a straight binary data representation and no data conversion is thus required.

The PmodAD1 ADC and PmodDA2 DAC status signals from their datapath modules are processed in parallel by the controller and both provide a logic 1 signal to indicate that processing has completed. The DAC is outputting the previous analog signal sampled by the ADC while the ADC is obtaining the current analog signal sample. This illustrates wavefront processing and results in a sampling rate 2.5 times that of the straight-through ADC-DAC system using the Xilinx Spartan-3E Starter Board ADC and DAC. This primarily occurs because there is no SPI bus contention.

Xilinx FIR Compiler LogiCORE Block
The straight-through ADC-DAC systems implemented with the controller and datapath modules can be easily modified to perform a DSP function. The Xilinx CORE Generator provides a FIR Compiler LogiCORE block which is a common interface for the design of FPGA resource efficient FIR digital filters with either multiply-accumulate (MAC) or distributed arithmetic (DA) architectures. The FIR Compiler LogiCORE block is evoked from within the Xilinx ISE project and provides design windows as described in the text.

The Xilinx LogiCORE block FIR Compiler provides from 2 to 1024 taps and integer signed or unsigned coefficients that range from 1-bit to 32-bit precision and is configured as a datapath module in Verilog. The signed or unsigned integer data widths can be as large as 32-bit and are accommodated by accumulator widths up to 74-bits.

The LogiCORE FIR Compiler here uses a coefficient file and implements a single-rate FIR MAC digital filter with one data channel (although up to 256 channels are possible), a clock frequency of 50 MHz and a sampling rate of 250 kHz. The clock and sample frequency parameters and the input and output bus widths specified are those utilized with the Spartan-3E Starter Board and the integral ADC and DAC. The FIR Compiler has the capability to reload coefficients on a separate data bus, change the digital filter function with an external command and utilize different coefficient sets for each data channel.

The FIR Compiler digital filter is implemented as part of a DSP system. A portion of the Verilog top module s3eadcfirdac.v is given in Listing 3. The complete module is similar to that of the top module in Listing 1. Additional details of the implementation of the DSP system are provided in the text. The six Verilog modules operate in parallel in the top module. An output pin of a 6-pin port is again used to monitor the ADC conversion command signal (conad). The estimated cycle latency of the FIR filter from the FIR Compiler LogiCORE block design windows is 16 × Tclock = 0.32 µsec since fclock = 50 MHz.

The actual latency of the digital filter can be determined by measuring the time interval between the assertion of the new data input signal (nd) and the appearance of the data ready output signal (rdy) of the FIR Compiler datapath module. The latency is measured to be 0.39 µsec, which is less than the available in-sequence computation time of 0.46 µsec, since the sampling period Ts = 4 µsec and the sampling period for the straight-through or ADC-DAC system in Listing 1 is TADC-DAC = approximately 3.54 µsec (282 ksamples/sec).

The controller module genadcfirdac.v utilizes a five state FSM controller for the FIR Compiler LogiCORE block DSP system. The FIR filter LogiCORE block datapath ready for data signal (rfd) sets the sampling rate of 250 ksamples/sec. The controller module responses with the FIR filter LogiCORE block new input data available signal (nd). The data ready status signal (rdy) indicates that the 25-bit FIR digital filter data (firdout) is available.


Listing 3. FIR Compiler LogiCORE block DSP system top module for the Xilinx Spartan-3E Starter Board (portion).
(Click this image to view a larger, more detailed version)

Xilinx PicoBlaze soft core processor
The availability of the soft core processor within the FPGA has significantly altered the ambiance of embedded design. Although hard core processors are available for some FPGAs, the soft core processor seemingly has more flexibility including the capability of utilizing several within the embedded design. Since the soft core processor can be modified easily, new and specialized architectures can also be readily developed.

The Xilinx 8-bit PicoBlaze soft core processor is quite resource efficient and is suitable for debouncing and reading slide switches and push buttons and external peripherals with a low data throughput, such as the LCD of the Spartan-3E Starter Board. The Verilog controller and datapath construct is seemingly more suited to high throughput internal soft core peripherals and external peripherals, such as an ADC, the FIR Compiler LogiCORE block and a DAC in a DSP system. However, the Xilinx PicoBlaze soft core processor and the Verilog controller and datapath construct can also interact synergistically through port signals. An embedded design then would consist of processing elements appropriate for the task. Summary
This short description illustrates the concept and utility of the FSM and controller-datapath Verilog modules for real-time processing in DSP on the Spartan-3E Starter Board. The text Embedded Design Using Programmable Gate Arrays features complete Xilinx ISE projects for the PS/2 mouse and keyboard, PGA, ADC, DAC of the Xilinx Spartan-3E Starter Board and reference material for the Verilog hardware description language, the Xilinx EDA, DSP, digital communications, digital control and the Xilinx PicoBlaze soft core processor.

This 320 page, soft cover text is available from Bookstand Publishing. These complete Xilinx ISE project files for the Xilinx Spartan-3E Starter Board and the Digilent Basys Board are available for download (password protected) as described in the text.

Chapter 1: Verilog Hardware Description Language
      Programmable Logic Devices
      Hardware Description Languages
      Verilog Syntax and Concepts
      Structural Models in Verilog
      Behavioral Models in Verilog
      Finite State Machines
      Controller-Datapath Construct
      C to Verilog Translation
      Arithmetic Functions
      Programmable Gate Array and Microprocessor Comparison

Chapter 2: Verilog Design Automation
      Xilinx ISE WebPACK
      Xilinx CORE Generator
      Xilinx Floorplanner
      Xilinx Simulator
      Xilinx Verilog Language Templates
      Xilinx Architecture Wizard
      Xilinx LogiCORE Blocks
      Warnings and Errors in Synthesis

Chapter 3: Programmable Gate Array Hardware
      Evaluation Boards
      Selection of an Evaluation Board
      User Constraints File
      Hardware Components and Peripherals
            Crystal Clock Oscillator
            Light Emitting Diodes
            Push Buttons and Slide Switches
            Rotary Shaft Encoder
            Seven Segment Display
            Liquid Crystal Display
            PS/2 Keyboard Port
            PS/2 Mouse Port
            Digital-to-Analog Converter
            Analog-to-Digital Converter
            Auxiliary Ports and Peripherals

Chapter 4: Digital Signal Processing, Communications and Control
      Sampling and Quantization
      Discrete Time Sequences
      Discrete Frequency Response
      Analog Output
      Digital Signal Processing Embedded System
      IIR Digital Filter
      FIR Digital Filter
      FIR Compiler LogiCORE Block
            Digital Signal Processing System
      Sine-Cosine Look-Up Table LogiCORE Block
            DTMF Generator
      Direct Digital Synthesis Compiler LogiCORE Block
            Frequency Generator
            Frequency Shift Keying Modulator
            Phase Shift Keying Modulator
            Quaternary Phase Shift Keying Modulator
      Linear Finite Shift Register
      Data Communication
            RS-232 Standard
            Manchester Encoder-Decoder
      Digital Control
            Pulse Width Modulation
            DC Servomotor Speed Control

Chapter 5: Embedded Soft Core Processors
      Programmable Gate Array Processors
      Xilinx PicoBlaze Development Tools
      Xilinx PicoBlaze Processor Architecture
      Xilinx PicoBlaze Reference Projects
      Soft Core Processors and Peripherals


print

email

rss

Bookmark and Share

Joinpost comment




Please sign in to post comment

Navigate to related information

Most Popular

Product Parts Search

Enter part number or keyword
PartsSearch


FeedbackForm