botan.git - Unnamed repository; edit this file 'description' to name the repository.

	Commit message (Collapse)	Author	Age	Files	Lines
*	Convert Data_Store::Matcher to using lambdas	lloyd	2009-11-16	2	-35/+8
\|
*	Use auto for long iterator names, etc.	lloyd	2009-11-16	2	-24/+2
\| \| \| \|	It will be nice to convert to the range-based for loop once that's available.
*	propagate from branch 'net.randombit.botan' (head ↵	lloyd	2009-11-13	5	-30/+36
\|\ \| \| \| \| \| \| \| \| \| \|	ac888e57b614c623590d79ab615353ad7c76ef68) to branch 'net.randombit.botan.c++0x' (head 9bf78ed7e2521a328f6db7acbc1cd81b07718230)
\| *	Use memcpy for bulk loads if algorithm endianness matches CPU endianess.	lloyd	2009-11-10	1	-0/+9
\| \|
\| *	Also #undef bool after including <altivec.h>	lloyd	2009-11-10	1	-0/+1
\| \|
\| *	Rename CPUID::has_intel_aes to has_aes_intel, and add CPUID::has_aes_via,	lloyd	2009-11-10	1	-2/+11
\| \| \| \| \| \| \| \|	which is currently just a stub returning false.
\| *	Tick to 1.9.3-dev	lloyd	2009-11-06	3	-28/+15
\| \| \| \| \| \| \| \| \| \|	Rename BOTAN_UNALIGNED_LOADSTOR_OK to BOTAN_UNALIGNED_MEMORY_ACCESS_OK which is somewhat more clear as to the point.
* \|	propagate from branch 'net.randombit.botan' (head ↵	lloyd	2009-11-05	4	-4/+22
\|\\| \| \| \| \| \| \| \| \| \| \|	cead7027e70b68a8b4ae2e5bd8f290066e5ea22a) to branch 'net.randombit.botan.c++0x' (head 9edbd485060131b695170f5243a100e06e3b0c71)
\| *	Add an andc operation, in SSE2 and AltiVec, may be useful for Serpent sboxes	lloyd	2009-11-04	4	-4/+22
\| \|
* \|	propagate from branch 'net.randombit.botan' (head ↵	lloyd	2009-11-02	3	-35/+58
\|\ \ \| \|/ \|/\| \| \| \| \| \| \|	2773c2310e8c0a51975987a2dd6c5824c8d43882) to branch 'net.randombit.botan.c++0x' (head f13cf5d7e89706c882604299b508f356c20aae3a)
\| *	Attic-ize all of src/timer, except for time_t_to_tm and system_time	lloyd	2009-10-13	1	-0/+39
\| \| \| \| \| \| \| \|	(which will go later) which will live in the new time.h
\| *	Fixup post-merge breakage	lloyd	2009-10-13	1	-1/+1
\| \|
\| *	propagate from branch 'net.randombit.botan' (head ↵	lloyd	2009-10-13	2	-34/+18
\| \|\ \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	139d6957d20f0b1202e0eacc63cb011588faffde) to branch 'net.randombit.botan.c++0x' (head c16676fa6c393bc3f46a044755ce525a013380a6)
\| \| *	propagate from branch 'net.randombit.botan' (head ↵	lloyd	2009-09-30	2	-34/+18
\| \| \|\ \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	8a5eb02c2e451fc983f234f7ba2f023f5a7d294f) to branch 'net.randombit.botan.c++0x' (head e18cd411269e15638df3298d6a4165446e7ca529)
\| \| \| *	propagate from branch 'net.randombit.botan' (head ↵	lloyd	2009-09-17	6	-110/+70
\| \| \| \|\ \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	92e05ab242e4b6998d685961c53700534a673bce) to branch 'net.randombit.botan.c++0x' (head 27ce37b971ec5cb1f80a9a95b13d5a951b96653b)
\| \| \| * \	propagate from branch 'net.randombit.botan' (head ↵	lloyd	2009-09-08	2	-34/+18
\| \| \| \|\ \ \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	5cadcc57872bef55226579df57349fe09a93d1f5) to branch 'net.randombit.botan.c++0x' (head d1747f0394aa4442e5b32b9102b830e1a86f0e5a)
\| \| \| \| * \	propagate from branch 'net.randombit.botan' (head ↵	lloyd	2009-07-21	11	-959/+24
\| \| \| \| \|\ \ \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	95eb8083f5884531e5ca0667388f8a6fb6d05c41) to branch 'net.randombit.botan.c++0x' (head 56e105e678540c8bcafa4d0198c19a9489fbf8d1)
\| \| \| \| * \ \	propagate from branch 'net.randombit.botan' (head ↵	lloyd	2009-07-15	3	-34/+19
\| \| \| \| \|\ \ \ \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	5438defd358f82e876917a8bd6d735305ecb0a8e) to branch 'net.randombit.botan.c++0x' (head cbdb2fd418557add29a536f7bdb6e78db16f725c)
\| \| \| \| \| * \ \	propagate from branch 'net.randombit.botan' (head ↵	lloyd	2009-07-03	2	-2/+9
\| \| \| \| \| \|\ \ \ \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	d6d32791adfa878b6fc0dd3a5b65a665b7bbb549) to branch 'net.randombit.botan.c++0x' (head 54deb0e078aab8cd91c8fd8819d1e6668fc762da)
\| \| \| \| \| * \ \ \	propagate from branch 'net.randombit.botan' (head ↵	lloyd	2009-06-04	5	-91/+19
\| \| \| \| \| \|\ \ \ \ \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	6a746ccf1e957dba703e65372050a7bd4d6b117d) to branch 'net.randombit.botan.c++0x' (head f54bb7b391eb3b71f380a68ddd460debdc31545d)
\| \| \| \| \| \| * \| \| \|	A few experiments with auto keyword type inference. Looks like things will	lloyd	2009-04-01	1	-18/+18
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	be much cleaner, though I am looking forward to the new for syntax which will simplify a lot of these uses further.
\| \| \| \| \| \| * \| \| \|	Remove copy_if, now included in C++0x (also, it turns out, not being used	lloyd	2009-04-01	1	-16/+0
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	in the source).
\| \| \| \| \| \| * \| \| \|	Remove the mutex classes in favor of C++0x's std::mutex and std::lock_guard	lloyd	2009-04-01	3	-57/+1
\| \| \| \| \| \| \| \| \| \|
* \| \| \| \| \| \| \| \| \|	Slight cleanups in the Altivec detection code for readability.	lloyd	2009-10-29	1	-5/+12
\| \| \| \| \| \| \| \| \| \|
* \| \| \| \| \| \| \| \| \|	Add a new looping load_be / load_le for loading large arrays at once, and	lloyd	2009-10-29	1	-0/+46
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	change some of the hash functions to use it as low hanging fruit. Probably could use further optimization (just unrolls x4 currently), but merely having it as syntax is good as it allows optimizing many functions at once (eg using SSE2 to do 4-way byteswaps).
* \| \| \| \| \| \| \| \| \|	Fix cpuid with icc (tested with 11.1)	lloyd	2009-10-29	1	-2/+2
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Document SHA optimizations, AltiVec runtime checking, fixes for cpuid for both icc and msvc.
* \| \| \| \| \| \| \| \| \|	Give each version of SIMD_32 a public bswap()	lloyd	2009-10-29	3	-11/+29
\| \| \| \| \| \| \| \| \| \|
* \| \| \| \| \| \| \| \| \|	Add new function enabled() to each of the SIMD_32 instantiations which	lloyd	2009-10-29	3	-1/+9
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	returns true if they might plausibly work. AltiVec and SSE2 versions call into CPUID, scalar version always works.
* \| \| \| \| \| \| \| \| \|	No \|\|= operator!	lloyd	2009-10-29	1	-7/+7
\| \| \| \| \| \| \| \| \| \|
* \| \| \| \| \| \| \| \| \|	Add CPUID::have_altivec for AltiVec runtime detection.	lloyd	2009-10-29	2	-0/+61
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Relies on mfspr emulation/trapping by the kernel, which works on (at least) Linux and NetBSD.
* \| \| \| \| \| \| \| \| \|	Use register writes in the Altivec code for stores because Altivec's handling	lloyd	2009-10-29	1	-7/+16
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	for unaligned writes is messy as hell. If writes are batched this is somewhat easier to deal with (somewhat).
* \| \| \| \| \| \| \| \| \|	Kill realnames on new modules not in mailine	lloyd	2009-10-29	1	-2/+0
\| \| \| \| \| \| \| \| \| \|
* \| \| \| \| \| \| \| \| \|	propagate from branch 'net.randombit.botan' (head ↵	lloyd	2009-10-29	5	-0/+575
\|\ \ \ \ \ \ \ \ \ \ \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	8fb69dd1c599ada1008c4cab2a6d502cbcc468e0) to branch 'net.randombit.botan.general-simd' (head c05c9a6d398659891fb8cca170ed514ea7e6476d)
\| * \| \| \| \| \| \| \| \| \|	Add copyright + license on the new SIMD files	lloyd	2009-10-28	4	-2/+14
\| \| \| \| \| \| \| \| \| \| \|
\| * \| \| \| \| \| \| \| \| \|	Add an AltiVec SIMD_32 implementation. Tested and works for Serpent and XTEA	lloyd	2009-10-28	1	-0/+178
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	on a PowerPC 970 running Gentoo with GCC 4.3.4 Uses a GCC syntax for creating literal values instead of the Motorola syntax [{1,2,3,4} instead of (1,2,3,4)]. In tests so far, this is much, much slower than either the standard scalar code, or using the SIMD-in-scalar-registers code. It looks like for whatever reason GCC is refusing to inline the function: SIMD_Altivec(__vector unsigned int input) { reg = input; } and calls it with a branch hundreds of times in each function. I don't know if this is the entire reason it's slower, but it definitely can't be helping. The code handles unaligned loads OK but assumes stores are to an aligned address. This will fail drastically some day, and needs to be fixed to either use scalar stores, which (most?) PPCs will handle (if slowly), or batch the loads and stores so we can work across the loads. Considering the code so far loads 4 vectors of data in one go this would probably be a big win (and also for loads, since instead of doing 8 loads for 4 registers only 5 are needed).
\| * \| \| \| \| \| \| \| \| \|	Define SSE rotate_right in terms of rotate left, and load_be in terms	lloyd	2009-10-28	1	-3/+2
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	of load_le + bswap
\| * \| \| \| \| \| \| \| \| \|	Add subtraction operators to SIMD_32 classes, needed for XTEA decrypt	lloyd	2009-10-26	2	-0/+26
\| \| \| \| \| \| \| \| \| \| \|
\| * \| \| \| \| \| \| \| \| \|	Add a wrapper for a set of SSE2 operations with convenient syntax for 4x32	lloyd	2009-10-26	4	-0/+360
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	operations. Also add a pure scalar code version. Convert Serpent to use this new interface, and add an implementation of XTEA in SIMD. The wrappers plus the scalar version allow SIMD-ish code to work on all platforms. This is often a win due to better ILP being visible to the processor (as with the recent XTEA optimizations). Only real danger is register starvation, mostly an issue on x86 these days. So it may (or may not) be a win to consolidate the standard C++ versions and the SIMD versions together. Future work: - Add AltiVec/VMX version - Maybe also for ARM's NEON extension? Less pressing, I would think. - Convert SHA-1 code to use SIMD_32 - Add XTEA SIMD decryption (currently only encrypt) - Change SSE2 engine to SIMD_engine - Modify configure.py to set BOTAN_TARGET_CPU_HAS_[SSE2\|ALTIVEC\|NEON\|XXX] macros
* \| \| \| \| \| \| \| \| \| \|	Remove the 'realname' attribute on all modules and cc/cpu/os info files.	lloyd	2009-10-29	5	-10/+0
\|/ / / / / / / / / / \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Pretty much useless and unused, except for listing the module names in build.h and the short versions totally suffice for that.
* \| \| \| \| \| \| \| \| \|	Add ; after call to VC++'s __cpuid, not a macro	lloyd	2009-10-25	1	-1/+1
\| \| \| \| \| \| \| \| \| \|
* \| \| \| \| \| \| \| \| \|	Cast the u32bit output array to an int* when calling the VC++ intrinsic,	lloyd	2009-10-25	1	-3/+6
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	since it passes signed ints for whatever reason. Ensure CALL_CPUID is always defined (previously, it would not be if on an x86 but compiled with something other than GCC, ICC, VC++).
* \| \| \| \| \| \| \| \| \|	Add new store_[l\|b]e variants taking 8 values.	lloyd	2009-10-23	1	-16/+108
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Add new load options that are passed a number of variables by reference, setting them all at once. Will allow for batching operations (eg using SIMD operations to do 128-bit wide bswaps) for future optimizations.
* \| \| \| \| \| \| \| \| \|	Enable CPUID on x86 (checking wrong macro name)	lloyd	2009-10-21	1	-1/+1
\|/ / / / / / / / /
* \| \| \| \| \| \| \| \|	In to_u32bit, ignore space characters in input	lloyd	2009-10-06	1	-0/+3
\| \| \| \| \| \| \| \| \|
* \| \| \| \| \| \| \| \|	Clean up cpuid calling	lloyd	2009-10-06	1	-32/+26
\|/ / / / / / / /
* \| \| \| \| \| \| \|	Disable prefetch in AES for now. Problem: with iterative modes like CBC,	lloyd	2009-09-30	1	-12/+0
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	the prefetch is called for each block of input, and so a total of (4096+256)/64 = 68 prefetches are executed for each block. This reduces performance of iterative modes dramatically. I'm not sure what the right approach for dealing with this is.
* \| \| \| \| \| \| \|	Add cpuid check for Intel AES	lloyd	2009-09-30	1	-1/+8
\| \| \| \| \| \| \| \|
* \| \| \| \| \| \| \|	Add vendor ID for AMD	lloyd	2009-09-29	1	-1/+1
\| \| \| \| \| \| \| \|
* \| \| \| \| \| \| \|	Significantly rework CPUID support. Add cache line detection	lloyd	2009-09-29	2	-87/+99
\| \| \| \| \| \| \| \|
* \| \| \| \| \| \| \|	Change the prefetching interface; move to PREFETCH namespace, and add a	lloyd	2009-09-29	1	-9/+25
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	helper function for fetching both inputs and outputs of block ciphers.