DDR2 Fitter(Place & Route) report errors

November 5, 2017, 7:46 pm

≫ Next: LEDs in interleaving method

≪ Previous: M20K RAM block usage question

The FPGA is Cyclone IV EP4CE30F23C7, FBGA484-7
And ddr2 parameters are as follow:

2.jpg
It reports as follows:

Can anyone help me to solve this problem, thanks

Attached Images

1.jpg (174.8 KB)
2.jpg (197.7 KB)
3.jpg (293.8 KB)

↧

LEDs in interleaving method

November 5, 2017, 10:15 pm

≫ Next: Max+Plus II with parallel T-Guar, missing license file

≪ Previous: DDR2 Fitter(Place & Route) report errors

Hello,
my name is Mustapa Hussainar and i need help in generating a program that can on and off two or more LEDs in interleaving method. im a beginner in VHDL and FPGA and its important for me to learn this things. For now i have done single LED blinking project using clock divider, comparator and lpm counter. therefore i think the next step is to learn how to generate interleave pwm switching to turn on and off the LED. im using Quartus II software and an Altera DE2-70. Hope you guys can help me with the logic needed to generate multiple pwm signals that uses the same duty cycle but out of phase with each other.

Thank you

Regards,
Mustapa Hussainar

↧

Max+Plus II with parallel T-Guar, missing license file

November 5, 2017, 11:31 pm

≫ Next: How NIOS gets reconnection of Simple Socket Server

≪ Previous: LEDs in interleaving method

Hello first.
We made years ago a FLEX8000-Design with Max+Plus II. Now we need to do a slight change in the design, but we can not find our corresponding license.dat in our inventory. With the Baseline-Version we are not able to edit FLEX-8000, neither with the Quartus-Version .
Only the Dongle is here.

Question: Can we edit the license-file for the Baseline-Version with the informations for the dongle?
Maybe some of you had the same problem.

Thanks for every help or suggestions.

Urs from Sersa

↧

How NIOS gets reconnection of Simple Socket Server

November 6, 2017, 1:58 am

≫ Next: Config Flash EPCQ4A Cylone II

≪ Previous: Max+Plus II with parallel T-Guar, missing license file

I am working on Simple Socket Server with Max10 Dev., everything works fine except:

When a connection established, if the client sends a 'q', then the connection can be reconnected again afterward.
But if I closed/or click "disconnection" button(without sending 'q') of my client, then re-connection is not possible anymore.

Could you tell me please how to fix this problem/challenge ?
Thanks in advance!!

↧

Config Flash EPCQ4A Cylone II

November 6, 2017, 2:56 am

≫ Next: PDN1708: Alternative Configuration Devices

≪ Previous: How NIOS gets reconnection of Simple Socket Server

Hello there

I want to replace the configflash EPCS4SI8N with the EPCQ4ASI8A. The flash is used to configure
a Cyclone II chip. In the product discontinuance notification is written, that the support for these devices will be included in version 17.1 of Quartus Prime.
But Cyclone II is only suppurted till version 13.1.
Is there any workaround that i can program a cyclone II Chip with the new configflash?

Thanks for quick help

Greets

↧

PDN1708: Alternative Configuration Devices

November 6, 2017, 3:05 am

≫ Next: How can i find the IP address of DE 10 nano kit fpga board

≪ Previous: Config Flash EPCQ4A Cylone II

Hello,

Within the PDN1708 (https://www.altera.com/content/dam/a...cn/pdn1708.pdf), it is said that alternative configuration devices will have support in version 17.1 of Quartus Prime.

When I read the "what's new" page of version 17.1 (https://www.altera.com/products/desi...hat-s-new.html), there is no mention of these devices.

Where can I get this information please?

Thank you

Damien

↧

How can i find the IP address of DE 10 nano kit fpga board

November 6, 2017, 7:47 am

≫ Next: NDRnage Kernels Global Memory Write Pattern

≪ Previous: PDN1708: Alternative Configuration Devices

How can i find the IP address of DE 10 nano kit fpga board?
I am using LINUX OS. please help me out.
Thanks.

↧

NDRnage Kernels Global Memory Write Pattern

November 6, 2017, 8:25 am

≫ Next: Interfacing uOLED Display to Cyclone V HPS

≪ Previous: How can i find the IP address of DE 10 nano kit fpga board

I came up with some basic questions, which I would like to discuss:

1) I still don't understand how SIMD is being implemented in FPGA. In GPU, workitems being assigned to SIMDs are being executed concurrently. Is it the same case for FPGA? Or they are just going to be interleaved? For example in case of SIMD 16, Does 16 workitems being scheduled to a compute unit and being executed in interleaved fashion?

2) In case SIMD in FPGA is not as parallel as GPU, does 16 workitems being scheduled at once to a compute unit and wait to finish? or the next work items can still come in and be pushed into the pipeline?

3) Imagine a case where every work-item only writes one value at the end of execution into the global memory, and it writes it to index of "GlobalID" of that work item. In case of having many compute units and having SIMD of 16, at each clock cycle many write operations will be issued with non-continuous addresses (Based on my understanding). This seems to be inefficient with regards to high performance memory access. Does that mean, kernels designed for GPU are not suitable for FPGA, with regards to their memory access pattern?

4) Does LSU (Load Store Unit) performs memory coalescing? In other words, does it have any kind of buffer to receive memory write operations, and then flush them into the memory after grouping them into multiple continuous blocks of data?

↧

Interfacing uOLED Display to Cyclone V HPS

November 6, 2017, 9:04 am

≫ Next: Simultaneous Read and Write to DDR3 SDRAM

≪ Previous: NDRnage Kernels Global Memory Write Pattern

Hi,

I am new to Programming. I have to write a code to interface an uOLED Display int the HPS of Cyclone V. I have know idea how to go about it. Please share some data or video regarding this.

Regards
Kba

↧

Simultaneous Read and Write to DDR3 SDRAM

November 6, 2017, 10:12 am

≫ Next: Arria 10 IOPLL output not synchronous with input

≪ Previous: Interfacing uOLED Display to Cyclone V HPS

Simultaneous read and write on DDR3 SDRAM using soft SDRAM Controller with UiPHY IP via Qsys Interconnect.

An AXI-MM master is connected to Qsys Interconnect to perform read and write operation on SDRAM slave,

Cyclone V is the FPGA device, with PL only system.
Read is initiated first, with read length- 512, Write is initiated later say after 20 cycles of delay from Read operation. with a length 512.

Read address locations does not overlap with that of write.

This specific operation fails as Interconnect drops the wr_ready signal halting the operation.

While The same worked well using Xilinx Interconnect on a Xilinx Device. (independent read and write on Cyclone 5 was successful but the operation demands a simultaneous R/W)

Is there any Interconnect requirements or bridges that I would permit simultaneous read and write?.

Thanks for help

↧

Arria 10 IOPLL output not synchronous with input

November 6, 2017, 1:25 pm

≫ Next: UART using LOAN on DE10 Nano

≪ Previous: Simultaneous Read and Write to DDR3 SDRAM

I'm using an IOPLL on the Arria10. I really need a 2x synchronous clock with my input, but i cannot get any synchronous output (well sometimes I can but its rare). In the attachments, this is the same test points, the blue clock is my IOPLL input clock, the yellow is the 2X output. What you see in the attachments is three different JTAG loads of my project, I get a different alignment each time.

I've tried Normal, Source Synchronous, and direct modes, none of which consistently yield a synchronous output. I need those edges to align. Am I wrong in expecting them to be synchronous? I don't see any other PLL options in the Arria 10 library, all the other PLLs are dedicated to gigabit transceivers either in PCI, Ethernet or vanilla transceivers.

Thanks

Attached Images

↧

UART using LOAN on DE10 Nano

November 6, 2017, 1:31 pm

≫ Next: Book Suggestion

≪ Previous: Arria 10 IOPLL output not synchronous with input

hi everyone, i'm Ivan

i'm working with a de10 nano and i'm trying to communicate by UART to my computer using LOAN IO for use only this HPS's pheripheral. I've created the system in Qsys and it compiles in "Analisys & Synthesis"but in the "Fitter (Place & Route)" part i got a problem cause appear this message:

Error (21179): Pins memory_mem_ck and memory_mem_ck_n form a differential pair and uses pseudo-differential output node Uart_3_hps_0:hps_0|Uart_3_hps_0_hps_io:hps_io|Uart _3_hps_0_hps_io_border:border|hps_sdram:hps_sdram_ inst|hps_sdram_p0:p0|hps_sdram_p0_acv_hard_memphy: umemphy|hps_sdram_p0_acv_hard_io_pads:uio_pads|hps _sdram_p0_acv_hard_addr_cmd_pads:uaddr_cmd_pads|hp s_sdram_p0_clock_pair_generator:clock_gen[0].uclk_generator|pseudo_diffa_0. However, these pins also have an I/O standard 2.5 V that cannot be supported by the pseudo-differential output node.

what i have to do to can compile?
Somebody can help me please?
Thanks

↧

Book Suggestion

November 6, 2017, 3:24 pm

≫ Next: Controlling NDRange kernel M20K RAM replication

≪ Previous: UART using LOAN on DE10 Nano

Hello, I am still a newbie concerning TimeQuest and writing SDC timing scripts. I found a book "Constraining Designs for Synthesis and Timing Analysis: A Practical Guide to Synopsys Design Constraints" ISBN-13: 978-1461432685 . I was wondering if anyone can comment on this book?

Thanks,
Joe

↧

Controlling NDRange kernel M20K RAM replication

November 6, 2017, 4:56 pm

≫ Next: Error (114016): Out of memory in module quartus_fit.exe

≪ Previous: Book Suggestion

Hello all,

Is there a way to limit the number of work groups executing simultaneously in a compute unit (NDRange kernel)? I have an issue where the compiler is replicating RAM so much that my RAM resource usage is way over 100% solely due to its crazy replication scheme (30x!). Each work item has it's own private memory so it's not replicating it for banking purposes, and the compiler reports the replication is to "efficiently support simultaneous workgroups". Does the compiler really prioritize performance over being able to build the kernel at all??

I saw an excellent post earlier about single work item kernels and #pragma max_concurrency which I didn't know about; and am hoping there's something similar for NDRange kernels.

Thanks in advance.

↧

Error (114016): Out of memory in module quartus_fit.exe

November 6, 2017, 5:55 pm

≫ Next: Data is read before accumulation is finished

≪ Previous: Controlling NDRange kernel M20K RAM replication

i compile my project today,quartus 13.0sp1 32bit show me a error:"Error (114016): Out of memory in module quartus_fit.exe (399 megabytes used)"
the device i used is EP3C16Q240C8
my computer cofiguration is WIN7 professional 32bit,the RAM is 3.43GB available
it is strange,because i have compile this project about a month ago which is OK
during this month ,the project and my computer do not have any change
i am confused,does someone can give me some advise?

↧

Data is read before accumulation is finished

November 6, 2017, 11:23 pm

≫ Next: Easy examples to use the hps-2-fpga bridge (not the lightweight one)

≪ Previous: Error (114016): Out of memory in module quartus_fit.exe

================================================== =============
for(uint t = 0; t < loop_cnt; t++) {
//load data to data buffer
for(uint w = 0; w < TILE_WIDTH; w++) {
data[w] = read_channel_altera(data_in_ch);
}

for(uint h = 0; h < TILE_HEIGHT; h++) {
weight[h] = read_channel_altera(weight_in_ch);
}

//comput the matrix tile multiplication using the PE(mac) array
#pragma unroll
for(uint w = 0; w < TILE_WIDTH; w++) {
float data_temp = data[w];
#pragma unroll
for(uint h = 0; h < TILE_HEIGHT; h++) {
float weight_temp = weight[h];
float temp = data_temp * weight_temp;
if(t == 0)
output[h * TILE_WIDTH + w] = temp;
else
output[h * TILE_WIDTH + w] = output[h * TILE_WIDTH + w] + temp;
}
}

}
//declare output data to be enqueued in altara channel
lane output_lane;

for(uint w = 0; w < TILE_WIDTH; w++) {
#pragma unroll
for(uint h = 0; h < TILE_HEIGHT; h++) {
//multiply with scale and plus bias before moving it out
output_lane.lane_data[h] = output[h * TILE_WIDTH + w] * scale[h] + bias[h];
}

write_channel_altera(output_ch, output_lane);

}
================================================== ======================================

Here is a snippet of my code. Basically what I am doing is doing matrix multiplication and move the data out by channel if the accumulation is finished. But according to the hardware run, the output is not fully accumulated (it's moved out before the accumulation is finished, for example, if the correct output pattern is all 36, the hardware run result would be a mix of values smaller than 36). And the compilation report seems to support this (with TILE_WIDTH 4 and TILE_HEIGHT 8, the number of simultaneous reads to output local buffer should be 32, but in the report it's 40, which is because after accumulation I have 8 simultaneous reads to move the data out (32 + 8 = 40). So it looks like the accumulation and moving out is happening at the same time!! This is very weird because moving out should happen after accumulation is finished.
below is the report of local buffer output

================================================== =========================================

Local memory: Optimal. Requested size 128 bytes (rounded up to nearest power of 2), implemented size 128 bytes, stall-free, 40 reads and 32 writes. Additional information: - Banked on lowest dimension into 32 separate banks (this is a good thing). - Reducing accesses to exactly one read and one write for all on-chip memory systems may increase overall system performance.

================================================== ========================================

And advice would be greatly appreciated!!

↧

Easy examples to use the hps-2-fpga bridge (not the lightweight one)

November 7, 2017, 12:52 am

≫ Next: False path question - internal counter bit vs FSM state bit

≪ Previous: Data is read before accumulation is finished

Hi,

I'm looking for basic examples and tutorials of hps-2-fpga bridge implementation, ideally short, showing all config that should be done for the bridge to work.

My board is de0-nano-SoC, cyclone V.

I looked at : https://www.altera.com/support/suppo...n-example.html but... too complex for my level now.

Any example welcome, I know it is often needed to browse several examples to get all information needed.

Thanks a lot !
Franck.

Background of the request:

After many evening of work, I succeeded to get my DE0-nano-SOC boot and have a first communication with lw-hps-2-fpga bridge. Great !

Also, found packages to get image reading/processing in python. Works well also.

End goal is to drive a 64x32 led matrix, and be able to display jpeg images on it.

Now I would like to test/use/evaluate the hps-2-fpga bridge (not the lightweight one) to transfer images from hps to fpga.

↧

False path question - internal counter bit vs FSM state bit

November 7, 2017, 3:38 am

≫ Next: Issue With MegaWizard for generation DDR2 SDRAM Controller with ALTMEMPHY 13.1

≪ Previous: Easy examples to use the hps-2-fpga bridge (not the lightweight one)

Dear Altera Forum members...
This is my first post here :)
I have a sub-circuit that consists of a binary up-counter (LPM_counter) with asynchronous reset and count enable, and a comparator (LPM_compare).
The sub-circuit compares an input 5-bit vector to the count value and sets an output flag if the count value is greater-or-equal to the input data value.
This flag is used as a control bit to a finite state machine.

When running the TimeQuest timing analyzer tool in Quartus, I get setup timing violations with worst-case timing paths that involve the q[1]-bit and states of the FSM. My question is whether or not I can set these as false paths

The figure shows how the TimeQuest labels the nodes that violate the setup timing.
I was thinking that since the timing violations does not involve the flag used to control the state transitions, I can set the shown paths as false paths... Please comment.

Regards Ronny

Attached Images

temp.jpg (19.7 KB)

↧

Issue With MegaWizard for generation DDR2 SDRAM Controller with ALTMEMPHY 13.1

November 7, 2017, 3:40 am

≫ Next: About booting an Image

≪ Previous: False path question - internal counter bit vs FSM state bit

I have Windows-7 and Quartus 13.1 installed on my machine and also having Parmanent License for Quartus.

while generating the DDR2 SDRAM Controller with ALTMEMPHY my MegaWizard always STUCK at "Generating the functional Simulation model" and not go beyond that.

I have Select following Configuration for this IP Core.

PLL Frequency : 125 MHz
Memory Frequency : 166.667Mhz
Speed Grade : 7

*As i have Cyclone-III ESDK Board and board having Microne MT47H32M16CC-3 device so select this memory from MegaWizard Memory menu

I have Installed /reinstalled Quartus Several times but wont go beyond this point (Attached)

Any Suggestion in this regard is very helpfull.

k

Attached Images

Error.jpg (223.2 KB)

↧

About booting an Image

November 7, 2017, 5:23 am

≫ Next: 10G Example design simulation tcl error

≪ Previous: Issue With MegaWizard for generation DDR2 SDRAM Controller with ALTMEMPHY 13.1

How to generate preloader ?

Where can we get BSP editor?

Could someone eplease explain about it?

thanks

↧