Distributed System

Main focus when designing

Fault Tolerance
- Availibility: Can be achieved using replication
- Recoverability: Can be achieved using logging/transaction, durable storage etc.
Consistency

Does get() returns latest put()?
- Strong consistency
- Weak consistency
- Eventual Consistency
Performance
- Scalable throughput
- Lower latency

https://pdos.csail.mit.edu/6.824/papers/mapreduce.pdf

Fault Tolernace aspects

Co-ordinator re-runs map/redue function if worker node fails
Map/Reduce functions are functional/determenistic and can be run more than once
Other fail points
- Can co-ordinator fails? NO
- Slow workers(Stagglers)? Co-ordinator can run backup tasks and assigned to other worker node

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
src		src
.check-build		.check-build
.gitignore		.gitignore
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md