Hashing is the act of transforming data from arbitrary size to a fixed size value. For example the module operator.
Encode is a way to transform data to another form to preserve usability. For example, 2 bit 0 and 1 can represent almost every kind of data we know.
Encryption is the act of transforming data to another form to preserve confidentiality. For example, AES encryption uses a secret key to transform data into an encrypted string. Only the person who has the same key can decrypt the encrypted string and read the message.
We can always try to guess the original string by calculating the hash of every possible input then comparing the result with the hash string. However, due to hashing collision, there might be more than one input that has the same output.
Symmetric encryption uses the same key for both encrypting and decrypting data.
Asymmetric encryption uses a public key to encrypt and a private key to decrypt data.
Fast Fash is the type of hash function with fast computation time and Slow Hash is the type hash function with slow computation time. Fast Hash is used for checking the integrity of data, while slow hash is used for hashing password.
We use Encode so data can be represented in a better use form. For example, we transform text, audio, image into 1 and 0 so computers can understand, store, process, etc.
A perfect hash function of a set is a hash function that maps distinct elements of this set to elements of another set without any collisions.
When a map initiates, two things need to be considered are the initial capacity and the load factor. The initial capacity is the capacity of the hash table underlying the map. When the load is bigger than the load factor, it means the underlying table is quite full, a process called rehashing happens, a twice bigger size hash table is allocated to store the data of the map.
SSL - Secure Socket Layer, is the original encryption protocol developed for HTTP. It now has been replaced by the TLS, Transport Layer Security, but the name SSL is still widely in use. Whenever a client makes a connection to a server over HTTPS, the client and server will initiate a TLS handshake, which happens after the TCP handshake has been finished.
The steps of a TLS handshake will vary depending on the key exchange algorithm in use. The RSA key exchange algorithm is used most often. The steps are as followed:
- The client kicks off the handshake by sending a "hello" message to the server. The message will include which TLS version, cipher suites the client supports, and a string of random bytes known as the "client random."
- The server sends back a message containing the server's SSL certificate, the server's chosen cipher suite, and the "server random" - another random string of bytes.
- Authentication: The client verifies the server's SSL certificate with the certificate authority that issued it.
- The client sends the "premaster secret", another random string of bytes. The premaster secret is encrypted with the public key from the SSL certificate and can only be decrypted with the private key by the server.
- The server decrypts the premaster secret.
- Session keys created: Both client and server generate session keys from the client random, the server random, and the premaster secret. They should arrive at the same results.
- Client is ready: The client sends a "finished" message that is encrypted with a session key.
- Server is ready: The server sends a "finished" message encrypted with a session key.
- Secure symmetric encryption achieved: The handshake is completed, and communication continues using the session keys.
TLS handshake can also use Diffie-Hellman (DH) as the key exchange algorithm. The steps then will be:
- Client's hello is the same as above.
- Server's hello also sends the server's random, chosen cipher suite, SSL certificate. The server also includes a digital signature, created by encrypting the client random, the server random, and its DH parameter using the private key.
- The client decrypts the server's digital signature using the SSL certificate's public key, verifying that the server controls the private key and is who it says it is. The client then sends its DH parameter to the server.
- The client and server use the exchanged DH parameters to calculate a matching premaster secret independently.
- Session keys created: Now both the client and server calculate the session key using the client random, the server random, and the premaster secret like in an RSA handshake.
The rest is the same as the RSA handshake.
To be defined.
CA stands for certificate authority. It is an entity that verifies the certificate of servers.
When a server and a client want to establish an SSL connection, the server will send its public key to the client. So how the client knows this public key is the right one, and no man in the middle tries to interfere in this conversation. To do that, the server also sent a certificate with the public key. The client will ask the CA to verify if the public key is the correct key of the one in the certificate.
Digital signature is a mathematical scheme to verify the authenticity of data.
HMAC stands for hash based message authentication code, is used to verify both data integrity and data authenticity.
How to store credential information efficiency? (user password, config key, database credential, user information, secret key,.... )
First, we need to know if we need to access the original value or not.
For data like passwords, we don't need to access the raw password of users, so we can use some hashing function to hash the password and store it in the database.
For data like user information, the raw data will need to be retrieved in some cases, so we need to use encryption, instead of hashing. To reduce the risk of data compromise, we need to store data and the encryption key in 2 different places.
Describe a way to defense DDOS? (actually, there are many kinds of DDOS not just network or memory, so this question is pretty complicated)
Block IP address, apply rate limit for IP, for users.