Ethernet The Occasional Outsider
Posted by
Zonk
on Thu May 25, 2006 02:18 PM
from the popular-kid-gets-snubbed dept.
from the popular-kid-gets-snubbed dept.
coondoggie writes to mention an article at NetworkWorld about the outsider status of Ethernet in some high-speed data centers. From the article: "The latency of store-and-forward Ethernet technology is imperceptible for most LAN users -- in the low 100-millisec range. But in data centers, where CPUs may be sharing data in memory across different connected machines, the smallest hiccups can fail a process or botch data results. 'When you get into application-layer clustering, milliseconds of latency can have an impact on performance,' Garrison says. This forced many data center network designers to look beyond Ethernet for connectivity options."
This discussion has been archived.
No new comments can be posted.
Ethernet The Occasional Outsider
|
Log In/Create an Account
| Top
| 169 comments
| Search Discussion
The Fine Print: The following comments are owned by whoever posted them. We are not responsible for them in any way.
Long Live! (Score:5, Funny)
One Ring to rule them all
Didn't RTFA? -Infiniband, FC and Myrinet beat Eth0 (Score:4, Interesting)
Infiniband http://en.wikipedia.org/wiki/Infiniband [wikipedia.org]
and Myrinet http://en.wikipedia.org/wiki/Myrinet [wikipedia.org]
http://h20311.www2.hp.com/HPC/cache/276360-0-0-0-
HP HPTC site
-What's the speed of dark?
My idea: a vat of salt water & CAT5 (Score:5, Funny)
(Seriously, haven't people heard cut-through switches which just look at the first part of the header and switch based on that... store-and-forward switches are so "1990s")
TDz.
Store & Forward ONLY for 10 to 100 to 1,000. (Score:4, Informative)
#1. You're running different speeds on the same switch (why?).
#2. You really want to cut down on broadcast storms (just fix the real problem, okay?)
Other than that, go for the speed! Full duplex!
For performance, run the same speed. (Score:5, Interesting)
I'm talking performance. Store & Forward hammers your performance. In my experience, you get better performance when you run the server at 100Mb full duplex (along with all the workstations) and use Cut Through than if you have the server on a Gb port, but run Store & Forward to your 100Mb workstations.
30 GB? Take that NSA and your outdated 622MB! (Score:2, Interesting)
(http://www.informationr.us/ | Last Journal: Monday November 05, @09:38AM)
100ms ethernet latency? (Score:5, Informative)
(http://www.federated.com/~jim)
This author does not understand the subject material.
(I suppose you could deliberatly overload a switch enough to get this number, maybe, but that would be silly, and your switch would need 1.25Mbytes of packet cache.)
Re:100ms ethernet latency? (Score:5, Informative)
"By comparison, latency in standard Ethernet gear is measured in milliseconds, or one-millionth of a second, rather than nanoseconds, which are one-billionth of a second"
http://www.google.com/search?hl=en&q=define%3Amil
"One thousandth of a second"
Seriously. How the fuck does this idiot get published?
Low-cost options? (Score:2)
(Last Journal: Monday February 04 2002, @03:31PM)
Not an Auspicious Start (Score:5, Informative)
"(By comparison, latency in standard Ethernet gear is measured in milliseconds, or one-millionth of a second, rather than nanoseconds, which are one-billionth of a second)"
That would be one-thousandth, not millionth (aka micro second). Not a good start...
When you get to many hops (Score:5, Funny)
Software design (Score:3, Interesting)
(http://slashdot.org/)
sharing memory
Sounds like bad design, or a known design trade off.
Quite reasonable, when on a slow link, until I know better assume the data I have is correct, if it isn't throw it out and start over. Not wildly different than branch prediction or other approaches to this type of information.
'When you get into application-layer clustering, milliseconds of latency can have an impact on performance,'
Faster is faster, not really a shocking concept.
Did you mean "microseconds"? (Score:4, Interesting)
(Last Journal: Monday April 03 2006, @07:23PM)
I don't know what sort of switches you use, but my home LAN, with two hops (including one over a wireless bridge) through only slightly-above-lowest-end DLink hardware, I consistantly get under 1ms.
When you get into application-layer clustering, milliseconds of latency can have an impact on performance
Again, I get less than 1ms, singular.
Now, I can appreciate that any latency slows down clustering, but the ranges given just don't make sense. Change that to "microseconds", and it would make more sense. But Ethernet can handle single-digit-ms latencies without breaking a sweat.
Milliseconds? (Score:2, Funny)
Oh, well. People tell me I'm just slow.
sharing memory over ethernet? (Score:1, Flamebait)
(http://www.johnmalone.org)
Maybe I should RTFA...
Lesson: Use appropriate Tech. (Score:2)
But it's never been a really high speed protocol. It's easy to beat, speed-wise, as long as you know what the network use looks like ahead of time.
Which of course is a killer for most general use, but for specialty use that's not so much of a problem.
Store & Forward Unnecessary? (Score:1)
Channel Bonding (Score:1, Offtopic)
Re:Channel Bonding (Score:5, Funny)
No kidding (Score:5, Interesting)
(Last Journal: Tuesday October 30, @04:48AM)
When I was writing applications at the San Diego Supercomputer Center, latency between nodes was the single greatest obstacle to getting your CPUs to running at their full capacity. A CPU waiting to get its data is a useless CPU.
Generally speaking, clusters who want high performance used something like Myrnet instead of ethernet. It's like the difference between consumer, prosumer, and professional products you see in, oh, every industry across the board.
As a side note, how many parallel apps solve the latency issue is by overlapping their communication and computation phases, instead of having them in discrete phases, this can greatly reduce the time a CPU is idle.
The KeLP kernel does overlapping automatically for you if you want: http://www-cse.ucsd.edu/groups/hpcl/scg/kelp.html [ucsd.edu]
OK article, bad title (Score:2)
I guess "Some Tiny Percentage of Data Centers use Something Faster than Ethernet in addition to Ethernet" didn't fit on the page.
Milliseconds? 100's of them? (Score:2)
(http://www.bitsex.net/)
Someone needs to look at their network... (Score:2)
Average latency is around 20ms.
Now I know this isn't as plain as straight ethernet but I'd have guessed the latency if anything on ATM + the change from 802.11g to ethernet to atm to ethernet to whatever would have been worse.
So either someone is using cheep hardware or has misconfigured their network.
Apart from that if I was running a cluster each machine would probably have two NIC's depending on their use - one using gigabit ethernet to provide the internal network between nodes on the cluster and the other for external use. The external network would be as normal, the internal network I'd ensure had minimal routers/switches between the nodes and any switches/routers where a) good quality and b) correctly configured.
Real-World Experience (Score:1)
(http://www.quake4world.com/)
Whoops...
The worst post! (Score:3, Informative)
$ ping google.com
PING google.com (64.233.167.99) 56(84) bytes of data.
64 bytes from 64.233.167.99: icmp_seq=1 ttl=241 time=20.1 ms
64 bytes from 64.233.167.99: icmp_seq=2 ttl=241 time=19.6 ms
64 bytes from 64.233.167.99: icmp_seq=3 ttl=241 time=19.5 ms
What a shame that such a post is on the front page of slashdot! Someone please s/milli/micro.
Slashdot summary wrong, actual article is better (Score:4, Interesting)
Ethernet latency is about 100uS through a gigE switch, round-trip. A full-sized packet takes about 200uS (micro seconds), round-trip. Single-ended latency is about half of that.
There are proprietary technologies that have much faster interconnects, such as the infiniband technology described in the article. But the article also mentions the roadblock that a proprietary technology respresents over a widely-vendored standard. The plain fact of the matter is that ethernet is so ridiculously cheap these days it makes more sense to solve the latency issue in software, for example by designing a better cache coherency management model and by designing better clustered applications, then it does with expensive proprietary hardware.
-Matt
Ethernet Problems, IB problems, etc (Score:2, Interesting)
(http://www.mrjim.org/)
One thing that isn't mentioned in the article is the amount of CPU power required to send out ethernet packets. The typical rule is 1 GHz of processing power is required to send 1 Gb of data on the wire. So, if you want to send 10 Gbs of data, you'd need 10 GHz of processor - pretty steep price. Some companies have managed to get this down to 1 GHz/3 Gbs of processing, and one startup(NetEffect) is now claiming roughly ~0.1 Ghz for ~8 Gbs on the wire, using iWarp. With this, your system can be processing information rather than creating packets.
The problem with Infiniband, Myranet, etc is that they require another card in your system (and associated heat problems, size issues, etc), special switches and equipment, and new training for your staff on how to get it up and going. However, IWarp, which is based on TCP/IP can use your standard DHCP, ping, tracert, ipconfig, etc and can allow a single card to be used for networking to the outside world (TCP/IP), clustering in the datacenter(IWarp), and storage (IScsi). 1 card, no special new software widgets, 10 Gb speeds.
However, you cant go and buy a iWarp card from Fry's today. Although, you cant buy an infiniband or myranet card there either
Tolkien ring (Score:4, Funny)
Well yeah... (Score:2)
Of course, on a large scale network, it is much simpler and easier to do switch-routing frames, but for tightly controlled networks, source-routed can be very advantageous.
I will say switch routed frames have the *potential* for much better utilization of multi-port aggregations, but largely the member of a multi-port aggregation used to send a packet is not based on port congestion, but rather on a hash of the MAC address referenced in the packet, which is nothing a source routed network couldn't do.
Ugh, they must have read an old paper (Score:1)
(http://schluting.com/)
For those who don't understand... (Score:4, Informative)
If you go to a high-speed network, what you get is a packet being forwarded as it's being read. By the time the first few bits are through the switch, it should be able to figure out the next hop and have the packet moving in that direction. Phone companies have huge problems with the delays in Ethernet. This is why the ATM protocol was invented, it's hard to use, awkward and not too graceful, but it can fly through a switching network like nobody's business.
Ethernet is also extremely sloppy--Any switch along the way is allowed to throw a packet away and wait for the originator to resend causing a HUGE hiccupp in the communication stream (Most if not all routers do this whenever an address is not in it's forwarding table yet).
IIRC the faster protocols see a "Routing" packet in the stream and set up forwarding hardware before getting the actual packet/stream, then wait until the end of the packet (or entire stream) to tear the route down again.
Ethernet, however, due to it's simplicity is bridging the gaps. It's a pretty crappy protocol in general, but we keep throwing better, smarter hardware at it in an effort to brute-force it into the parameters we require. (I work for a company that makes Ethernet over fiber hardware, and have worked for companies based around ATM, SONET and other interesting solutions).
I guess the point of the article was to remind a world that is coming to believe that ethernet is the end-all be-all of networking that it was always just the simplest hack available and therefore the easiest to deal with.
Just like SNMP.
Not so sure (Score:2)
But here's where I notice some performance. We've got all the servers on a gigabit vlan. I can shift a 300MB file between servers in under 20 seconds. Transitioning a 5MB link takes five minutes.
So we did what we could to eliminate latency and we see it in the performance of our network.
Could somebody clue me in? (Score:1)
"But in data centers, where CPUs may be sharing data in memory across different connected machines..."
I have re-read this bit like twenty times and still have no idea what it means. The terms used clashes badly, which leads me to believe that the guy has no idea what he was talking about
The didn't do enough research on the topic (Score:2)
(http://www.sushifaq.com/)
Blue Gene (Score:1)
Any ideas why Ethernet is not an outsider here.
* lon3st4r *