···2233- Gather as much domain knowledge as possible. Technical knowledge is not enough.
44- If it can be solved with SQL, stick to SQL.
55+- To be successful, data engineers need to care about:
66+ - **Fidelity:** how reliably can data be transferred and stored without corruption or loss?
77+ - **Capacity:** how much data can be moved and how quickly?
88+ - **Reliability:** how well can systems recover from outages and incidents?
99+ - **Speed of execution:** how quickly can you get a new data source up and running?
510611## Data Pipelines
712
+1-2
Teamwork.md
···5252 2) Remove more than you add.
5353 3) Optimize what works.
5454 4) Shorten iteration cycles. **[Boyd's Law of Iteration](https://blog.codinghorror.com/boyds-law-of-iteration/): speed of iteration beats quality of iteration**.
5555- 5) [[Automation | Automate]].
5656- 6) Standards, a straight pipe is easier to fix than a curved one
5555+ 5) [[Automation | Automate]] and keep standards.
5756- Keep great global [[coordination]] and incentivize local experimentation.
5857 - Being able to run small and compounding experiments (on the product or company [[processes]] and systems) is important.
5958 - [Some experiments won't work](https://www.lesswrong.com/posts/97LgacucCxmyjYiNT/the-archipelago-model-of-community-standards). But oftentimes it _feels_ like it wont work when in fact you just haven't stuck with it long enough for it to bear fruit. This is hard enough for _solo_ experiments. For group experiments, where not just one but _many_ people must all try a thing at once and _get good at it_, all it takes is a little defection to spiral into a mass exodus.