If you need assistance, please send an email to forum at 4hv dot org. To ensure your email is not marked as spam, please include the phrase "4hv help" in the subject line. You can also find assistance via IRC, at irc.shadowworld.net, room #hvcomm.
Support 4hv.org!
Donate:
4hv.org is hosted on a dedicated server. Unfortunately, this server costs and we rely on the help of site members to keep 4hv.org running. Please consider donating. We will place your name on the thanks list and you'll be helping to keep 4hv.org alive and free for everyone. Members whose names appear in red bold have donated recently. Green bold denotes those who have recently donated to keep the server carbon neutral.
Special Thanks To:
Aaron Holmes
Aaron Wheeler
Adam Horden
Alan Scrimgeour
Andre
Andrew Haynes
Anonymous000
asabase
Austin Weil
barney
Barry
Bert Hickman
Bill Kukowski
Blitzorn
Brandon Paradelas
Bruce Bowling
BubeeMike
Byong Park
Cesiumsponge
Chris F.
Chris Hooper
Corey Worthington
Derek Woodroffe
Dalus
Dan Strother
Daniel Davis
Daniel Uhrenholt
datasheetarchive
Dave Billington
Dave Marshall
David F.
Dennis Rogers
drelectrix
Dr. John Gudenas
Dr. Spark
E.TexasTesla
eastvoltresearch
Eirik Taylor
Erik Dyakov
Erlend^SE
Finn Hammer
Firebug24k
GalliumMan
Gary Peterson
George Slade
GhostNull
Gordon Mcknight
Graham Armitage
Grant
GreySoul
Henry H
IamSmooth
In memory of Leo Powning
Jacob Cash
James Howells
James Pawson
Jeff Greenfield
Jeff Thomas
Jesse Frost
Jim Mitchell
jlr134
Joe Mastroianni
John Forcina
John Oberg
John Willcutt
Jon Newcomb
klugesmith
Leslie Wright
Lutz Hoffman
Mads Barnkob
Martin King
Mats Karlsson
Matt Gibson
Matthew Guidry
mbd
Michael D'Angelo
Mikkel
mileswaldron
mister_rf
Neil Foster
Nick de Smith
Nick Soroka
nicklenorp
Nik
Norman Stanley
Patrick Coleman
Paul Brodie
Paul Jordan
Paul Montgomery
Ped
Peter Krogen
Peter Terren
PhilGood
Richard Feldman
Robert Bush
Royce Bailey
Scott Fusare
Scott Newman
smiffy
Stella
Steven Busic
Steve Conner
Steve Jones
Steve Ward
Sulaiman
Thomas Coyle
Thomas A. Wallace
Thomas W
Timo
Torch
Ulf Jonsson
vasil
Vaxian
vladi mazzilli
wastehl
Weston
William Kim
William N.
William Stehl
Wesley Venis
The aforementioned have contributed financially to the continuing triumph of 4hv.org. They are deserving of my most heartfelt thanks.
Registered Member #4266
Joined: Fri Dec 16 2011, 03:15AM
Location:
Posts: 874
Hi I'm writing a program and would like to find the fast platform. The code has if statement and alot of branching, not many for loops, but alot of basic blocks that are the same.
Was thinking of using omp on the cpu, as I don't think the graphics card would speed up the code?, is fpga good or do the branch's slow it down? What about 1000 pics or audino?
Registered Member #65
Joined: Thu Feb 09 2006, 06:43AM
Location:
Posts: 1155
For some small platforms, people will dedicate FPGA space to kernel modules that solve specific problems efficiently in parallel.
OpenCL is part of the nVidia SDK, but whether it runs "faster" depends on the problem. A 24+ core multi-cpu machine will churn through data more quickly given it doesn't need to copy into GPU memory space. However, the same machine can not match 400+ 1.8 GHz dedicated gpu vector cells running in parallel. OpenMP is fine when the problem can be broken apart, and doesn't need set locality in the cluster host partition slice. But in other situations... it can run "slower" than even a single core.
Note it takes far less time to learn Intel's Threading Building Blocks libraries and proper algorithm design... A good compiler will usually in-line small functions to exploit pipelining.
The "Cloud" demand has shifted technologies into a new class of design problem. This code example is very helpful in learning about these new paradigms:
Registered Member #4266
Joined: Fri Dec 16 2011, 03:15AM
Location:
Posts: 874
Thanks Carbon_Rod Got rid of most of the if statements and replaced it with a lookup table. Have you got any information on severs racks, do you set them up like a desktop? or do you think that a gaming rig would be a better option.
Registered Member #65
Joined: Thu Feb 09 2006, 06:43AM
Location:
Posts: 1155
A reliable rack server costs a bit more as they use special parity checked dram to detect errors, reliable Intel CPUs, and have several management options most people never encounter. If you already have a SAN, than the incremental cost of adding cores is minimal. There are also special GPU modules for this type of server, but they're not really useful for the task of serving files.
Desktops have some advantages as they are inexpensive if purchased used, and have more space for random parts. The low-end $50 GeForce GTX 295 is a great deal thanks to Microsoft, as they no longer really work for modern games and the linux CUDA developer drivers are mature for these cards. Note, don't bother putting more than 1 GPU card in a machine, and use at least a “750†watt power supply.
Hosts with an older Intel quad run these cards just fine, as do the 24 core >i7 workstations... Ignore modern sleaze-box labels, and use a benchmark cpu list when buying. You will find a discrepancy between value and performance in modern retail outlets.
Registered Member #4266
Joined: Fri Dec 16 2011, 03:15AM
Location:
Posts: 874
Its a chemical finder type program, I have to bruteforce alot of combinations, checked out a desktop motherboard with four sockets that can handle amd opteron 12core, but that and ram would set the price at 6 grand....so still dreaming, that would take 4 hours for 500 chemical combinations, hopefully looking at 100million combinations...
With graphics cards do you get a lower performance with if and for loops? I can unroll some of the for loops. Last time I tryed to write a kernel for a Gcard it wasn't much faster than cpu, could you post or link to a good reference of how to program them.
Registered Member #4266
Joined: Fri Dec 16 2011, 03:15AM
Location:
Posts: 874
There was a 8 core 1 socket for $1000, still haven't ruled out grahics cards, or pics the code could fit on a 32bit pic easly , maybe have a pic with 2gig sdcard with another layer for the second stage processing, $1000 dollars would give 1k cores, do able?
This site is powered by e107, which is released under the GNU GPL License. All work on this site, except where otherwise noted, is licensed under a Creative Commons Attribution-ShareAlike 2.5 License. By submitting any information to this site, you agree that anything submitted will be so licensed. Please read our Disclaimer and Policies page for information on your rights and responsibilities regarding this site.