Welcome
Username or Email:

Password:


Missing Code




[ ]
[ ]
Online
  • Guests: 22
  • Members: 0
  • Newest Member: omjtest
  • Most ever online: 396
    Guests: 396, Members: 0 on 12 Jan : 12:51
Members Birthdays:
All today's birthdays', congrats!
Alfons (36)
Coronafix (51)
AmonRa (44)


Next birthdays
05/11 ramses (16)
05/11 Arcstarter (31)
05/11 Zak (15)
Contact
If you need assistance, please send an email to forum at 4hv dot org. To ensure your email is not marked as spam, please include the phrase "4hv help" in the subject line. You can also find assistance via IRC, at irc.shadowworld.net, room #hvcomm.
Support 4hv.org!
Donate:
4hv.org is hosted on a dedicated server. Unfortunately, this server costs and we rely on the help of site members to keep 4hv.org running. Please consider donating. We will place your name on the thanks list and you'll be helping to keep 4hv.org alive and free for everyone. Members whose names appear in red bold have donated recently. Green bold denotes those who have recently donated to keep the server carbon neutral.


Special Thanks To:
  • Aaron Holmes
  • Aaron Wheeler
  • Adam Horden
  • Alan Scrimgeour
  • Andre
  • Andrew Haynes
  • Anonymous000
  • asabase
  • Austin Weil
  • barney
  • Barry
  • Bert Hickman
  • Bill Kukowski
  • Blitzorn
  • Brandon Paradelas
  • Bruce Bowling
  • BubeeMike
  • Byong Park
  • Cesiumsponge
  • Chris F.
  • Chris Hooper
  • Corey Worthington
  • Derek Woodroffe
  • Dalus
  • Dan Strother
  • Daniel Davis
  • Daniel Uhrenholt
  • datasheetarchive
  • Dave Billington
  • Dave Marshall
  • David F.
  • Dennis Rogers
  • drelectrix
  • Dr. John Gudenas
  • Dr. Spark
  • E.TexasTesla
  • eastvoltresearch
  • Eirik Taylor
  • Erik Dyakov
  • Erlend^SE
  • Finn Hammer
  • Firebug24k
  • GalliumMan
  • Gary Peterson
  • George Slade
  • GhostNull
  • Gordon Mcknight
  • Graham Armitage
  • Grant
  • GreySoul
  • Henry H
  • IamSmooth
  • In memory of Leo Powning
  • Jacob Cash
  • James Howells
  • James Pawson
  • Jeff Greenfield
  • Jeff Thomas
  • Jesse Frost
  • Jim Mitchell
  • jlr134
  • Joe Mastroianni
  • John Forcina
  • John Oberg
  • John Willcutt
  • Jon Newcomb
  • klugesmith
  • Leslie Wright
  • Lutz Hoffman
  • Mads Barnkob
  • Martin King
  • Mats Karlsson
  • Matt Gibson
  • Matthew Guidry
  • mbd
  • Michael D'Angelo
  • Mikkel
  • mileswaldron
  • mister_rf
  • Neil Foster
  • Nick de Smith
  • Nick Soroka
  • nicklenorp
  • Nik
  • Norman Stanley
  • Patrick Coleman
  • Paul Brodie
  • Paul Jordan
  • Paul Montgomery
  • Ped
  • Peter Krogen
  • Peter Terren
  • PhilGood
  • Richard Feldman
  • Robert Bush
  • Royce Bailey
  • Scott Fusare
  • Scott Newman
  • smiffy
  • Stella
  • Steven Busic
  • Steve Conner
  • Steve Jones
  • Steve Ward
  • Sulaiman
  • Thomas Coyle
  • Thomas A. Wallace
  • Thomas W
  • Timo
  • Torch
  • Ulf Jonsson
  • vasil
  • Vaxian
  • vladi mazzilli
  • wastehl
  • Weston
  • William Kim
  • William N.
  • William Stehl
  • Wesley Venis
The aforementioned have contributed financially to the continuing triumph of 4hv.org. They are deserving of my most heartfelt thanks.
Forums
4hv.org :: Forums :: Computer Science
« Previous topic | Next topic »   

CPU, graphics cards ,FPGA, multi pics/audino speed

1 2 3 
Move Thread LAN_403
Andy
Mon Nov 25 2013, 06:40AM Print
Andy Registered Member #4266 Joined: Fri Dec 16 2011, 03:15AM
Location:
Posts: 874
Hi
I'm writing a program and would like to find the fast platform. The code has if statement and alot of branching, not many for loops, but alot of basic blocks that are the same.

Was thinking of using omp on the cpu, as I don't think the graphics card would speed up the code?, is fpga good or do the branch's slow it down?
What about 1000 pics or audino?

Thanks
Back to top
Carbon_Rod
Mon Nov 25 2013, 07:27AM
Carbon_Rod Registered Member #65 Joined: Thu Feb 09 2006, 06:43AM
Location:
Posts: 1155
For some small platforms, people will dedicate FPGA space to kernel modules that solve specific problems efficiently in parallel.

OpenCL is part of the nVidia SDK, but whether it runs "faster" depends on the problem. A 24+ core multi-cpu machine will churn through data more quickly given it doesn't need to copy into GPU memory space. However, the same machine can not match 400+ 1.8 GHz dedicated gpu vector cells running in parallel.
OpenMP is fine when the problem can be broken apart, and doesn't need set locality in the cluster host partition slice. But in other situations... it can run "slower" than even a single core.

Note it takes far less time to learn Intel's Threading Building Blocks libraries and proper algorithm design...
A good compiler will usually in-line small functions to exploit pipelining.

The "Cloud" demand has shifted technologies into a new class of design problem.
This code example is very helpful in learning about these new paradigms: Link2
wink

Back to top
Andy
Tue Nov 26 2013, 01:35AM
Andy Registered Member #4266 Joined: Fri Dec 16 2011, 03:15AM
Location:
Posts: 874
I don't mind purchasing the hardware(would like to), but don't want to send it into the cloud.
What info do you need?
Back to top
Carbon_Rod
Tue Nov 26 2013, 09:06AM
Carbon_Rod Registered Member #65 Joined: Thu Feb 09 2006, 06:43AM
Location:
Posts: 1155
Describe big Theta, start here:
Link2

Read: Link2
Boost: Link2
STL: Link2


Back to top
Andy
Tue Nov 26 2013, 11:37PM
Andy Registered Member #4266 Joined: Fri Dec 16 2011, 03:15AM
Location:
Posts: 874
Thanks Carbon_Rod
Got rid of most of the if statements and replaced it with a lookup table. Have you got any information on severs racks, do you set them up like a desktop? or do you think that a gaming rig would be a better option.
Back to top
Carbon_Rod
Wed Nov 27 2013, 06:59AM
Carbon_Rod Registered Member #65 Joined: Thu Feb 09 2006, 06:43AM
Location:
Posts: 1155
A reliable rack server costs a bit more as they use special parity checked dram to detect errors, reliable Intel CPUs, and have several management options most people never encounter. If you already have a SAN, than the incremental cost of adding cores is minimal. There are also special GPU modules for this type of server, but they're not really useful for the task of serving files.

Desktops have some advantages as they are inexpensive if purchased used, and have more space for random parts. The low-end $50 GeForce GTX 295 is a great deal thanks to Microsoft, as they no longer really work for modern games and the linux CUDA developer drivers are mature for these cards. Note, don't bother putting more than 1 GPU card in a machine, and use at least a “750” watt power supply.

Hosts with an older Intel quad run these cards just fine, as do the 24 core >i7 workstations...
Ignore modern sleaze-box labels, and use a benchmark cpu list when buying.
Link2
You will find a discrepancy between value and performance in modern retail outlets.

What problem are you trying to solve?
Link2
Back to top
Andy
Wed Nov 27 2013, 07:51AM
Andy Registered Member #4266 Joined: Fri Dec 16 2011, 03:15AM
Location:
Posts: 874
Its a chemical finder type program, I have to bruteforce alot of combinations, checked out a desktop motherboard with four sockets that can handle amd opteron 12core, but that and ram would set the price at 6 grand....so still dreaming, that would take 4 hours for 500 chemical combinations, hopefully looking at 100million combinations...

With graphics cards do you get a lower performance with if and for loops? I can unroll some of the for loops. Last time I tryed to write a kernel for a Gcard it wasn't much faster than cpu, could you post or link to a good reference of how to program them.

Thanks for your help
Back to top
Shrad
Wed Nov 27 2013, 08:14AM
Shrad Registered Member #3215 Joined: Sun Sept 19 2010, 08:42PM
Location:
Posts: 780
have a look at blade servers, they provide multi cpu and RAM with a small form factor and a much smaller than the desktop multi cpu units
Back to top
Andy
Wed Nov 27 2013, 10:15AM
Andy Registered Member #4266 Joined: Fri Dec 16 2011, 03:15AM
Location:
Posts: 874
There was a 8 core 1 socket for $1000, still haven't ruled out grahics cards, or pics the code could fit on a 32bit pic easly , maybe have a pic with 2gig sdcard with another layer for the second stage processing, $1000 dollars would give 1k cores, do able?

Back to top
Shrad
Wed Nov 27 2013, 12:02PM
Shrad Registered Member #3215 Joined: Sun Sept 19 2010, 08:42PM
Location:
Posts: 780
you would have to add some RAM or you will eat through read/write cycles of your SD card pretty quickly

maybe with playstations or something alike? I read somewhere you could install unix on some playstation 2 or 3 or xbox, I don't remember
Back to top
1 2 3 

Moderator(s): Chris Russell, Noelle, Alex, Tesladownunder, Dave Marshall, Dave Billington, Bjørn, Steve Conner, Wolfram, Kizmo, Mads Barnkob

Go to:

Powered by e107 Forum System
 
Legal Information
This site is powered by e107, which is released under the GNU GPL License. All work on this site, except where otherwise noted, is licensed under a Creative Commons Attribution-ShareAlike 2.5 License. By submitting any information to this site, you agree that anything submitted will be so licensed. Please read our Disclaimer and Policies page for information on your rights and responsibilities regarding this site.