botan.git - Unnamed repository; edit this file 'description' to name the repository.

	Commit message (Collapse)	Author	Age	Files	Lines
*	In all cases where the block size of the cipher is fixed, the key	lloyd	2010-10-14	30	-137/+176
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	parameters are as well. So make them template paramters. The sole exception was AES, because you could either initialize AES with a fixed key length, in which case it would only be that specific key length, or not, in which case it would support any valid AES key size. This is removed in this checkin; you have to specifically ask for AES-128, AES-192, or AES-256, depending on which one you want. This is probably actually a good thing, because every implementation other than the base one (SSSE3, AES-NI, OpenSSL) did not support "AES", only the versions with specific fixed key sizes. So forcing the user to ask for the one they want ensures they get the ones that are faster and/or safer.
*	More size_t	lloyd	2010-10-13	3	-18/+18
\|
*	Implicit rounds based on key variable size	lloyd	2010-10-13	4	-27/+44
\|
*	Make the rounds implicit with the size of the key values	lloyd	2010-10-13	2	-55/+42
\|
*	More size_t. Document changes	lloyd	2010-10-13	3	-8/+8
\|
*	Add a new subclass for BlockCipher BlockCipher_Fixed_Block_Size, which	lloyd	2010-10-13	57	-203/+232
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	sets the block size statically and also creates an enum with the size. Use the enum instead of calling block_size() where possible, since that uses two virtual function calls per block which is quite unfortunate. The real advantages here as compared to the previous version which kept the block size as a per-object u32bit: - The compiler can inline the constant as an immediate operand (previously it would load the value via an indirection on this) - Removes 32 bits per object overhead (except in cases with actually variable block sizes, which are very few and rarely used)
*	s/BLOCK_SIZE/block_size()/	lloyd	2010-10-13	30	-143/+143
\|
*	Use output_length() instead of OUTPUT_LENGTH pseudo-property	lloyd	2010-10-13	2	-6/+6
\|
*	Add accessors for block size and output length	lloyd	2010-10-13	1	-0/+5
\|
*	Use size_t rather than u32bit in SymmetricAlgorithm	lloyd	2010-10-13	58	-132/+131
\|
*	size_t	lloyd	2010-10-13	1	-1/+1
\|
*	More size_t	lloyd	2010-10-13	4	-25/+25
\|
*	Use size_t rather than u32bit for the blocks argument of encrypt_n	lloyd	2010-10-12	63	-396/+422
\|
*	s/u32bit/size_t/ for block cipher parallelism queries	lloyd	2010-10-12	6	-9/+9
\|
*	Compile fix	lloyd	2010-10-04	1	-1/+1
\|
*	Make configure output more sensible wrt incompatible modules	lloyd	2010-09-28	1	-0/+4
\|
*	In the IDEA key schedule, using the extended Euclidean algorithm to	lloyd	2010-09-23	1	-17/+14
\| \| \| \| \| \|	compute the inverses mod 65537 exposed a timing vulnerability. Avoid this by instead using exponentiation, which takes constant time (up to variability in the multiplication operation, at least).
*	Remove more implicit vector to pointer conversions	lloyd	2010-09-14	1	-10/+12
\|
*	Handle the case that container size() returns something other than u32bit	lloyd	2010-09-14	2	-2/+3
\|
*	Completely remove the second parameter to SecureVector which specifies	lloyd	2010-09-14	40	-185/+202
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	the initial/default length of the array, update all users to instead pass the value to the constructor. This is a old vestigal thing from a class (SecureBuffer) that used this compile-time constant in order to store the values in an array. However this was changed way back in 2002 to use the same allocator hooks as the rest of the containers, so the only advantage to using the length field was that the initial length was set and didn't have to be set in the constructor which was midly convenient. However this directly conflicts with the desire to be able to (eventually) use std::vector with a custom allocator, since of course vector doesn't support this. Fortunately almost all of the uses are in classes which have only a single constructor, so there is little to no duplication by instead initializing the size in the constructor.
*	Remove more uses of vector to pointer implicit conversions	lloyd	2010-09-13	2	-6/+11
\|
*	First set of changes for avoiding use implicit vector->pointer conversions	lloyd	2010-09-13	19	-110/+129
\|
*	Anywhere where we use MemoryRegion::begin to get access to the raw pointer	lloyd	2010-09-13	2	-6/+6
\| \| \| \| \|	representation (rather than in an interator context), instead use &buf[0], which works for both MemoryRegion and std::vector
*	Big, invasive but mostly automated change, with a further attempt at	lloyd	2010-09-07	26	-47/+47
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	harmonising MemoryRegion with std::vector: The MemoryRegion::clear() function would zeroise the buffer, but keep the memory allocated and the size unchanged. This is very different from STL's clear(), which is basically the equivalent to what is called destroy() in MemoryRegion. So to be able to replace MemoryRegion with a std::vector, we have to rename destroy() to clear() and we have to expose the current functionality of clear() in some other way, since vector doesn't support this operation. Do so by adding a global function named zeroise() which takes a MemoryRegion which is zeroed. Remove clear() to ensure all callers are updated.
*	Prevent shadowing of one loop param with another	lloyd	2010-09-07	1	-1/+1
\|
*	The SSSE3 intrinsics apparently work under Sun Studio as well	lloyd	2010-09-07	1	-0/+1
\|
*	Fix paper ref URL, remove unused prefetch include	lloyd	2010-08-20	1	-5/+9
\|
*	Also use a smaller table in the first round of AES in the decrypt direction	lloyd	2010-08-19	1	-9/+19
\|
*	In the first round of AES, use a 256 element table and do the	lloyd	2010-08-18	1	-9/+28
\| \| \| \| \| \| \| \| \| \| \| \| \|	rotations in the code. This reduces the number of cache lines potentially accessed in the first round from 64 to 16 (assuming 64 byte cache lines). On average, about 10 cache lines will actually be accessed, assuming a uniform distribution of the inputs, so there definitely is still a timing channel here, just a somewhat smaller one. I experimented with using the 256 element table for all rounds but it reduced performance significantly and I'm not sure if the benefit is worth the cost or not.
*	Add also AES-192 using SSSE3	lloyd	2010-08-12	2	-23/+149
\|
*	Support AES-256 is the SSSE3 implementation	lloyd	2010-08-12	2	-5/+93
\|
*	Use _mm_set_epi32 instead of _mm_set_epi64x - VC++ obnoxiously only	lloyd	2010-08-11	2	-79/+79
\| \| \| \|	supports epi64x in 64-bit mode.
*	Only enable aes_ssse3 when compiling with GCC or Clang. For some dumbass	lloyd	2010-08-09	1	-0/+7
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	reasons, Intel C++ rejects const __m128i foo = _mm_set_epi64x(...) though it will accept if you use one of the _mm_set1 variants. And Visual C++ doesn't know about _mm_set_epi64x() in 32-bit mode for similarly dumb reasons - it works fine compiling for 64 bit but for whatever reason they don't offer this function when compiling as 32 bit. Unfortunately there isn't a good way to specify it's OK with a particular compiler with one arch but not another, so just disable it globally for the time being. The workaround for VC++ is probably to use _mm_set_epi32 and break up the input values into 32 bit chunks. ICC is a lost cause I fear.
*	Add an implementation of AES-128 using SSSE3 instructions. It runs in	lloyd	2010-08-09	3	-0/+454
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	constant time and on a Nehalem is significantly faster than the table based version. This implementation technique was invented by Mike Hamburg and described in a paper in CHES 2009 "Accelerating AES with Vector Permute Instructions". This code is basically a translation of his public domain x86-64 assembly code into intrinsics. Todo: Adding support for AES-192 and AES-256; this just requires implementing the key schedules. Currently only tested on an i7 with GCC (32 and 64 bit code); testing/optimization on 32-bit processors with SSSE3 like the Atom, and with Visual C++ and other compilers, are also todos.
*	Also allow clang with 32-bit assembly code, everything seems to work	lloyd	2010-08-08	1	-19/+0
\| \| \| \|	fine with latest SVN.
*	Only call the scalar versions if we actually have leftover blocks to	lloyd	2010-06-22	4	-8/+16
\| \| \| \|	process
*	Doxygen	lloyd	2010-06-21	1	-3/+26
\|
*	In IDEA, Noekeon, Serpent, XTEA, provide and use ro accessor functions	lloyd	2010-06-21	8	-15/+43
\| \| \| \| \|	for getting access to the key schedule, instead of giving the key schedule protected status, which is much harder tu audit.
*	Make Serpent's key_schedule and actual round keys private. Add	lloyd	2010-06-21	1	-1/+15
\| \| \| \| \|	protected accessor functions for get and set. Set is needed by the x86 version since it implements the key schedule directly.
*	Replace "@return a blah" and "@return the blah" with just "@return blah"	lloyd	2010-06-16	2	-3/+3
\|
*	Yet more Doxygen comments	lloyd	2010-06-16	4	-19/+28
\|
*	More Doxygen	lloyd	2010-06-15	4	-3/+27
\|
*	Don't allow access to key_schedule, just the members	lloyd	2010-06-15	2	-3/+5
\|
*	More Doxygen updates	lloyd	2010-06-15	6	-13/+42
\|
*	More Doxygen fixes	lloyd	2010-06-15	28	-40/+48
\|
*	Fix a few hundred Doxygen warnings	lloyd	2010-06-15	1	-2/+2
\|
*	Fix build	lloyd	2010-06-07	1	-0/+1
\|
*	Use "/" instead of "/*" in starting comments at the begining of a file.	lloyd	2010-06-07	7	-7/+6
\| \| \| \| \|	This caused Doxygen to think this was markup meant for it, which really caused some clutter in the namespace page.
*	Hid --enable-isa and instead expose --enable-{sse2,ssse3,aes-ni,altivec}	lloyd	2010-05-26	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \| \|	in the help. Unfortunately we can't just remove --enable-isa, because for the callback to work the target list has to already exist, and it only does by virtue of the default=[] param to the enable-isa setup. We could just use append_const, except then we can't run on Python 2.4, and the latest release of RHEL only has 2.4 :( Rename aes_ni to aes-ni in configuration-speak
*	Change BlockCipher::parallelism() to return the native parallelism of	lloyd	2010-05-25	6	-9/+17
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	the implementation rather than the preferred one. Update all implementations. Add a new function parallel_bytes() which returns parallelism() * BLOCK_SIZE * BUILD_TIME_CONSTANT This is because i noticed all current calls of parallelism() just multiplied the result by the block size already, so this simplified that code. The build time constant is set to 4, which was the previous default return value of parallelism(). However the SIMD versions returned 2native paralellism rather than 4, so this increases the buffer sizes used for those algorithms. The constant multiple lives in buildh.in and build.h, and is named BOTAN_BLOCK_CIPHER_PAR_MULT.