botan.git - Unnamed repository; edit this file 'description' to name the repository.

	Commit message (Collapse)	Author	Age	Files	Lines
*	Almost double the speed of MARS; from 55 MiB/s to 102 on my Core2.	lloyd	2009-11-11	3	-231/+216
\|
*	Slightly cleaner SHA-256 F1 func; ~1% faster	lloyd	2009-11-10	1	-3/+3
\|
*	Use memcpy for bulk loads if algorithm endianness matches CPU endianess.	lloyd	2009-11-10	1	-0/+9
\|
*	Remove SSE4 dependency in AES-192 key schedule, and also avoid requiring	lloyd	2009-11-10	2	-26/+25
\| \| \| \|	an extra 4 words at the end of EK for writing (unused) values.
*	Add AES-192 using AES-NI. Tested OK with Intel's simulator.	lloyd	2009-11-10	4	-10/+277
\| \| \| \| \| \| \|	Currently requires SSE4.1 for _mm_extract_epi32 for the key schedule, it would be nice to remove this dependency, though all currently known/scheduled chips with AES-NI (Intel Westmere and Sandy Bridge, and AMD Bulldozer) are supposed to include SSE 4.1 so this is not a huge problem.
*	Also #undef bool after including <altivec.h>	lloyd	2009-11-10	1	-0/+1
\|
*	Clean up cpuid test prog	lloyd	2009-11-10	1	-11/+18
\|
*	Rename CPUID::has_intel_aes to has_aes_intel, and add CPUID::has_aes_via,	lloyd	2009-11-10	3	-5/+17
\| \| \| \|	which is currently just a stub returning false.
*	Add unrolled versions of AES-NI code that will handle 4 blocks in parallel.	lloyd	2009-11-10	1	-12/+176
\| \| \| \| \|	No noticable change under the simulator (no surprises there), but should help a lot with pipelining on real hardware.
*	Fix errors in the AES-256 key schedule for the AES-NI version. Now passes	lloyd	2009-11-10	4	-198/+171
\| \| \| \| \| \| \| \| \|	tests under Intel's emulator. Document and enable in the engine. Merge both versions to aes_intel.cpp - some shared code and much similiar structure which might be sharable via macros.
*	Add AES-256 using AES-NI	lloyd	2009-11-10	3	-3/+243
\|
*	Make the AES implementation using Intel's AES instruction extension official;	lloyd	2009-11-10	4	-7/+9
\| \| \| \|	testing with Intel's emulator shows all green.
*	Split the AES vectors into 3 specifically named AES-128, AES-192, and	lloyd	2009-11-10	1	-1651/+1650
\| \| \| \| \| \|	AES-256 blocks, plus a handful remaining in a general AES block. This is necessary for any implementation which only supports a particular key size, since otherwise no tests at all will run on that implementation.
*	Add Nehalem/Westmere tags for ICC	lloyd	2009-11-10	1	-3/+7
\|
*	Make set_all_values in {ECDSA,ECKAEG}_{Public,Private}Key all non-virtual;	lloyd	2009-11-10	2	-6/+6
\| \| \| \| \| \| \|	virtual-ness not needed, and was overriding/overloading by argument which doesn't actually work in C++ and only happened to work because it was only ever used with the version implemented in that same class. ICC was warning, too. Make non-virtual.
*	Cleanups - remove emails from source files, they should only live in	lloyd	2009-11-10	19	-62/+39
\| \| \| \|	credits.txt and thanks.txt. Remove some various bits of formatting weirdness.
*	Remove my email address from the copyright headers in the tss files, not	lloyd	2009-11-10	2	-2/+2
\| \| \| \| \| \|	included elsewhere and my preference is for the only emails to be in credits.txt since emails change more often than names and I'd prefer them not to be constantly either wrong or needing updates.
*	In creating X.509 certificates and PKCS #10 requests, let (actually: require)	lloyd	2009-11-09	10	-39/+91
\| \| \| \| \| \| \|	the user to specify the hash function to use, instead of always using SHA-1. This was a sensible default a few years ago, when there wasn't a ~2^60 attack on SHA-1 and support for SHA-2 was pretty much nil, but using something else makes a lot more sense these days.
*	Clean up aes_128_key_expansion	lloyd	2009-11-06	1	-24/+18
\|
*	Respect --with-isa when choosing what to enable	lloyd	2009-11-06	1	-3/+4
\|
*	GCC doesn't know what Nehalem or Westmere are, though it does know about	lloyd	2009-11-06	1	-0/+3
\| \| \| \| \|	the AES and PCLMUL instructions. Oddness. For the time being, compile Nehalem and Westmere as Core2 + extras, probably close enough.
*	Dename unused length field	lloyd	2009-11-06	1	-1/+1
\|
*	Add a new need_isa marker for info.txt that lets a module depend	lloyd	2009-11-06	6	-25/+31
\| \| \| \| \| \| \| \| \| \| \| \|	on a particular ISA extension rather than a list of CPUs. Much easier to edit and audit, too. Add markers on the AES-NI code and SHA-1/SSE2. Serpent and XTEA don't need it because they are generic and only depend on simd_32 which will silenty swap out a scalar version if SSE2/AltiVec isn't enabled (since it turns out on supersclar processors just doing 4 blocks in parallel can be a win even in GPRs). Add pentium3 to the list of CPUs with rdtsc, was missing. Odd!
*	Add a complete but untested AES-128 using the AES-NI intrinsics.	lloyd	2009-11-06	3	-68/+147
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	From looking at how key gen works in particular, it seems easiest to provide only AES-128, AES-192, and AES-256 and not a general AES class that can accept any key length. This also has the bonus of allowing full loop unrolling which may be a win (how much so will depend on the latency/throughput of the AES instructions which is currently unknown). No block interleaving, though of course it works very nicely here, simply due to the desire to keep things simple until what is currently here can actually be tested. (Intel has an emulator that is supposed to work but just crashes on my machine...) I'm not entirely sure if byte swapping is required. Intel has a white paper out that suggests it isn't (and really it would have been stupid of them to not build this into the aes instructions), but who knows. If it turns out to be necessary there is a pretty fast bswap instruction for SSE anyway.
*	Stub for AES class using Intel's AES-NI instructions and an engine for	lloyd	2009-11-06	7	-0/+238
\| \| \| \| \|	providing it. Also stubs in the engine for VIA's AES instructions, but needs CPUID checking also.
*	The default_submodel option was used by configure.pl but configure.py	lloyd	2009-11-06	17	-39/+8
\| \| \| \| \| \|	ignores this unless it can detect (or is asked to use) a specific model; otherwise it compiles for the baseline ISA. Remove the default_submodel entries in the arch files.
*	The code for handling SIMD ISA extensions actually works fine for general	lloyd	2009-11-06	6	-35/+44
\| \| \| \| \| \| \| \|	ISA extensions (say, Intel's AES-NI, for instance) so change everything to reflect that. Also rename some of the amd64 models, and add entries for k10, nehalem, and westmere processors.
*	Make it possible to explicitly enable SIMD extensions.	lloyd	2009-11-06	1	-19/+28
\| \| \| \| \| \| \| \| \|	There is no point, as far as I can see, of being able to explicitly disable a SIMD or other ISA extension, because if you are compiling for that particular CPU the compiler might well choose to insert CPU-specific instructions anyway. For instance if one is compiling on a P4 but wants to disable SSE2, the right thing to do is compile for (say) an i686 which ensures that no P4 instructions will be emitted.
*	Tick to 1.9.3-dev	lloyd	2009-11-06	6	-37/+27
\| \| \| \| \|	Rename BOTAN_UNALIGNED_LOADSTOR_OK to BOTAN_UNALIGNED_MEMORY_ACCESS_OK which is somewhat more clear as to the point.
*	Generate SIMD macro flags for build.h from data in build-data/arch for	lloyd	2009-11-06	6	-6/+70
\| \| \| \| \| \|	SSE2, SSSE3, NEON, and AltiVec. Add entries for Intel Atom, POWER6 and POWER7, and the Cortex A8 and A9.
*	Add an andc operation, in SSE2 and AltiVec, may be useful for Serpent sboxes	lloyd	2009-11-04	4	-4/+22
\|
*	Set BOTAN_TARGET_CPU_HAS_SSE2 macro if amd64. Not set at all for any 32-bit	lloyd	2009-11-04	1	-0/+3
\| \| \| \| \|	x86 currently. This should be fixed. But it's an improvement over having to always set it manually, at least.
*	Indent and avoid one extra assignment	lloyd	2009-11-04	1	-3/+2
\|
*	propagate from branch 'net.randombit.botan.1_8' (head ↵	lloyd	2009-11-03	559	-6939/+13364
\|\ \| \| \| \| \| \| \| \| \| \|	6e8c18515725a70923b34118951252723dd4c29a) to branch 'net.randombit.botan' (head 77ba4ea5a4be36d6d029bcc852b2271edff0d679)
\| *	propagate from branch 'net.randombit.botan.1_8' (head ↵1.9.2	lloyd	2009-11-03	2	-2/+3
\| \|\ \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	a101c8c86b755a666c72baf03154230e09e0667e) to branch 'net.randombit.botan' (head 948905e3872b6f5904686533c6aa87d38ff90a71)
\| * \|	Update for 1.9.2 release 2009-11-03	lloyd	2009-11-03	4	-11/+5
\| \| \|
\| * \|	Conver the rest of the hash functions to use the array-based load instructions.	lloyd	2009-11-03	5	-40/+41
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	I'm not totally happy with this - in particular in all cases the size is a compile time constant - it would be nice to make use of this via tempalate metaprogramming. Also for matching endian loads, a straight memcpy would do the work, which would probably be even faster.
\| * \|	Slight cleanups in the Altivec detection code for readability.	lloyd	2009-10-29	1	-5/+12
\| \| \|
\| * \|	Add a new looping load_be / load_le for loading large arrays at once, and	lloyd	2009-10-29	11	-49/+104
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	change some of the hash functions to use it as low hanging fruit. Probably could use further optimization (just unrolls x4 currently), but merely having it as syntax is good as it allows optimizing many functions at once (eg using SSE2 to do 4-way byteswaps).
\| * \|	Fix cpuid with icc (tested with 11.1)	lloyd	2009-10-29	2	-2/+5
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Document SHA optimizations, AltiVec runtime checking, fixes for cpuid for both icc and msvc.
\| * \|	propagate from branch 'net.randombit.botan' (head ↵	lloyd	2009-10-29	30	-964/+1723
\| \|\ \ \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	4fd7eb9630271d3c1dfed21987ef864680d4ce7b) to branch 'net.randombit.botan.general-simd' (head 91df868149cdc4754d340e6103028acc82182609)
\| \| * \|	Clean up prep00_15 - same speed on Core2	lloyd	2009-10-29	1	-16/+10
\| \| \| \|
\| \| * \|	Clean up the SSE2 SHA-1 code quite a bit, make better use of C++ features	lloyd	2009-10-29	2	-308/+267
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	and also make it stylistically much closer to the standard SHA-1 code.
\| \| * \|	Format for easier reading	lloyd	2009-10-29	1	-31/+43
\| \| \| \|
\| \| * \|	Small cleanups (remove tab characters, change macros to fit the rest of	lloyd	2009-10-29	1	-123/+121
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	the code stylistically, etc)
\| \| * \|	Give each version of SIMD_32 a public bswap()	lloyd	2009-10-29	3	-11/+29
\| \| \| \|
\| \| * \|	Add new function enabled() to each of the SIMD_32 instantiations which	lloyd	2009-10-29	3	-1/+9
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	returns true if they might plausibly work. AltiVec and SSE2 versions call into CPUID, scalar version always works.
\| \| * \|	No \|\|= operator!	lloyd	2009-10-29	1	-7/+7
\| \| \| \|
\| \| * \|	Add CPUID::have_altivec for AltiVec runtime detection.	lloyd	2009-10-29	3	-0/+63
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Relies on mfspr emulation/trapping by the kernel, which works on (at least) Linux and NetBSD.
\| \| * \|	Rename sse2 engine to simd	lloyd	2009-10-29	2	-2/+2
\| \| \| \|