freevec.org

  • about
  • benchmarks
Home › Software

Search

Primary links

  • About
    • History of libfreevec
  • Benchmarks
    • libfreevec

Please donate to libfreevec to ensure its continuing development! Donations are done via Paypal.





Benchmarking

Comments/Conclusions

markos — Thu, 21/08/2008 - 11:31

There are many comments to be made looking at the results:

  • First, the AltiVec unit is a very powerful SIMD engine which is totally underused throughout the OS(Linux that is) apart from specific applications. In fact, its use in the kernel is strongly discouraged, perhaps rightly perhaps not, that still remains to be proved wrong.

  • It's a fact that the Athlon X2 is a faster CPU than every PowerPC CPU we tested, but still there are plenty of cases where it proved to be slower than its counterparts and that was not always the "fault" of AltiVec and libfreevec.

  • libmotovec behaves strangely and apart from strlen() is slower than libfreevec anyway. Still, it seems to be doing some really clever tricks and I intend to look at it closer in the future.

  • The MPC8610 is a very powerful CPU and especially when we consider its specs (1.3Ghz, 25Wts maximum power draw, WITHOUT the need for a northbridge) we see that the CPU is made to be a winner! Imagine a PowerPC netbook with a 8610, builtin display, fast ram access, very low power consumption. We just have to remember that the Intel Atom, the Via Nano AND the AMD Athlon 64 2000 all require a northbridge CPU which consumes too much power (see here and here). However from our own tests, the MPC8610 developer system consumes just 35Wt total at idle, and 37Wts at full power!!! This is a complete system with many features not required for an end consumer product (eg. a netbook). Coupled with the power of the AltiVec unit, it is just a question of time for someone to produce (again) a consumer PowerPC system using the MPC8610.

  • Finally, with regard to glibc performance, even if we take into account that some common routines are optimised (like strlen(), memcpy(), memcmp() plus some more), most string functions are NOT optimised. Not only that, glibc only includes reference implementations that perform the operations one-byte-at-a-time! How's that for inefficient? We're not talking about dummy unused joke functions here like memfrob(), but really important string and memory functions that are used pretty much everywhere, like strcmp(), strncmp(), strncpy(), etc.

  • In times where power consumption has become so much important, I would think that the first thing to do to save power is optimise the software, and what better place to start than the core parts of an operating system? I can't speak for the kernel -though I'm sure it's very optimised actually- but having looked at the glibc code extensively the past years, I can say that it's grossly unoptimised, so much it hurts.

  • AltiVec
  • Benchmarking
  • libfreevec
  • Login or register to post comments

libfreevec 1.0.4 benchmarks updated!

markos — Thu, 21/08/2008 - 11:23

Hello again,

I managed to find time to update all of the libfreevec benchmarks to the latest version 1.0.4 and also include more complete tests and added a non-ppc architecture (an Athlon X2 5000 @2.6Ghz) where the same tests were run (as 32-bit apps on a 64-bit Linux) for comparison. This is important for two reasons:

  • to find how PowerPC CPUs compare to a current popular x86 CPU (the same benchmarks will be done on an Intel CPU soon)
  • to find any deficiencies in glibc itself (as you will see there are many).

All benchmarks were run on OpenSuse 11.0, except for the G5 which uses Debian Lenny/testing. The compiler used was gcc 4.3.2. All functions have been tested to work correctly on each platform.

  • AltiVec
  • Benchmarking
  • libfreevec
  • Login or register to post comments
Syndicate content

SIMD

  • Algorithms (31)
    • Algebra (9)
      • Matrix operations (8)
    • Bit operations (0)
    • Codecs (0)
      • Audio (0)
      • Video (0)
    • Comparison (0)
      • image comparison (0)
      • Levenshtein (0)
    • Compression (0)
      • Bzip2 (0)
      • Gzip (0)
      • LZMA (0)
      • LZW (0)
      • Squashfs (0)
      • Zlib (0)
    • Encryption (0)
      • AES (0)
      • DES (0)
      • RSA (0)
      • Salsa (0)
      • SSL (0)
    • Hashing (1)
      • CRC (0)
      • TCP/IP checksum (0)
      • UMAC (0)
    • Memory operations (15)
    • Multiprecision (0)
    • Searching (5)
      • String searching (5)
    • Sorting (0)
  • Software (32)
    • Benchmarking (2)
    • Libraries (30)
      • Eigen2 (0)
      • libfreevec (22)
      • simdX86 (8)
  • Architecture (32)
    • AltiVec (32)
    • ARM NEON (0)
    • CELL SPU (0)
    • SSE (0)
    • VIS (0)

User login

  • Create new account
  • Request new password
  • about
  • benchmarks

Copyright (c)2008 by CODEX.
Powered by Drupal. Using theme Deco.
All Google charts have been created by the CSV Chart and Chart API Drupal modules.