This site serves as an extension of my brain, where I gather everything related to my favorite topics that I want to remember. Some entries are thoughtful blog posts, while others are simply collections of useful links I’ve come across.
Big Data Tools
11
Spark Streaming SPARK configuration Pyspark streaming from and to csv-file in Zeppelin: basic code example How to print DataFrame column to console Kafka Streaming Json using Gson: How to rename a json field? (Actually this is a Gson topic) Some handy Kafka commands (that I keep forgetting) How to access your clickhouse database with Spark in Python Clickhouse
Some handy Kafka commands (that I keep forgetting) Kafka Streaming Json using Gson: How to rename a json field? (Actually this is a Gson topic) Link to book "Spark The Definite Guide" How to access your clickhouse database with Spark in Python How to print DataFrame column to console SPARK configuration Spark Streaming Clickhouse and Kafka
Machine Learning
3
Math
16
Maximum Likelihood Estimation How to create a random variable with a Beta distribution from scratch, using only Uniform random variables Which envelope should you choose? Variance, Covariance, Autocovariance...what? Stochastic Process and Time Series (this page is not ready yet) Understanding the probability density function of the normal distribution kleiner Gauss Variance
Variance, Covariance, Autocovariance...what? Maximum Likelihood Estimation Jupyter-Notebook: Binominal Distribution Example Jupyter-Notebook: Birthday-Problem Basics Stars and Bars Argument (Einstein-Bose) or placing n identical objects into n distinct bins, or drawing k times from n balls with replacement, order does not matter Joint, Conditional and Marginal Probability Jupyter-Notebook: Confirming frequentists interpretation of conditional dependency
Network Analysis
4
networking ubuntu How to print flow information with nfdump to csv How to collect flows or how to install pmacct with kafka and nDPI support Do network stuff
Programming
8
Difference between function and method Val and def Coin change problem Call by Name vs Call by Value in Scala C Pointer and Reference Useful python helpers Python Snippets Java
Uncategorized
4
How to get a website containig pyscript running on a hosted domain Python Naming Conventions Maximum Likelihood Estimation What I learned from Black Hat Course Applied Data Science for Cyber Security