1
- Why Free and Open Machine Learning
2
- ==================================
1
+ # Why Free and Open Machine Learning
2
+
3
3
4
4
Free and Open machine learning is comparable with open source software
5
5
(FOSS - Free and Open Source Software). But openness for machine
@@ -51,8 +51,8 @@ These aspects are the core pillars of Free and Open Machine Learning.
51
51
52
52
![ Pillars of FOSS Machine Learning] ( /images/foss-ml.png )
53
53
54
- Open Source (FOSS)
55
- ------------------
54
+ ## Open Source (FOSS)
55
+
56
56
57
57
Free and open-source software (FOSS) is software that can be classified
58
58
as both free software and open-source software. FOSS is an inclusive
@@ -169,7 +169,7 @@ and data driven are used to sell you old solutions using this new trend.
169
169
You can easily be fooled since massive marketing efforts (time, money,
170
170
material) are invested to sell old buggy solutions as new innovative
171
171
machine learning powered solutions. In reality black box solutions from
172
- small or large vendors that seems good to be true for your use case, are
172
+ small or large vendors that seems too good to be true for your use case, are
173
173
almost always based on fads. This is why you should be very suspicious
174
174
when using cloud based machine offerings that offers you instant new
175
175
business and customers. Make sure to do a fast and cheap hands on
@@ -188,8 +188,8 @@ you can ask every IT company or consultant with the right skills to
188
188
audit the application. Because in the end: When security, safety or
189
189
privacy of your customers is at risk, you are accountable.
190
190
191
- Open data
192
- ---------
191
+ ## Open data
192
+
193
193
194
194
Free and Open machine learning does not only need FOSS software, but
195
195
also open data sets. Data is one of the most important aspects for
@@ -263,8 +263,8 @@ principles are:
263
263
- Be transparent.
264
264
- Respect data privacy regulations and laws (e.g. EU GDPR)
265
265
266
- Open Science and open algorithms
267
- --------------------------------
266
+ ## Open Science and open algorithms
267
+
268
268
269
269
Machine learning is a challenging science. Many researchers on
270
270
universities worldwide are working to develop new knowledge for solving
@@ -287,20 +287,20 @@ all available knowledge at an earlier stage in the research process.
287
287
Developing machine learning knowledge using open science means that
288
288
publications, data, results, and software is accessible without borders
289
289
for everyone to learn and build upon. Key pillars of open science
290
- important for open machine learning are:
290
+ important that are for open machine learning are:
291
291
292
- - Open Data:
293
- - Open source software
294
- - Open access
292
+ * Open Data
293
+ * Open source software
294
+ * Open access
295
295
296
- This so everyone can validate claims, inspect algorithms used and can
297
- created and read machine learning experiments done without large upfront
296
+ Everyone should be able to validate claims, inspect algorithms used and can
297
+ created and read machine learning experiments. All without large upfront
298
298
costs. Transparency is needed for trust. This also accounts for machine
299
299
learning applications, algorithms and frameworks used.
300
300
301
301
For real open machine learning applications providing real transparency
302
302
in terms of explaining how results are created is a complex problem.
303
- This is a direct result of how some type of machine learning algorithms
303
+ This is a direct result of how some types of machine learning algorithms
304
304
work. The current generation of machine learning systems offer
305
305
tremendous benefits, but their effectiveness is limited by the machine's
306
306
inability to explain its decisions and actions to users. The so called
@@ -311,21 +311,19 @@ Only when the basic principles for open science are followed, trust in
311
311
machine learning algorithms and software frameworks is possible.
312
312
313
313
The key of machine learning is smart algorithms. Algorithms that operate
314
- as "black boxes" should never be trusted. Fighting against e.g. your
315
- government is very difficult is no insight in the used algorithms. Open
314
+ as "black boxes" should never be trusted. Fighting against your
315
+ government is very difficult if you have no insight in the used algorithms. Open
316
316
algorithms developed in an open scientific environment are key for
317
317
trust.
318
318
319
319
FOSS machine learning with the use of open algorithms is needed to
320
320
prevent a "black box society". That is a society" in which key moments
321
321
of our lives are mediated by unknown, unseen, and arbitrary algorithms.
322
322
Open algorithms and algorithmic accountability is a way to stop this
323
- pattern. An open algorithm makes it possible for anyone to analyse.
324
- There is a freely available description and a FOSS reference
325
- implementation.
323
+ pattern. An open algorithm makes it possible for anyone to analyse.
324
+
325
+ ## Open architectures
326
326
327
- Open architectures
328
- ------------------
329
327
330
328
Architecture is a minefield. Architecture is not by definition high
331
329
level and sometimes relevant details are of the utmost importance. It is
@@ -342,29 +340,29 @@ solutions by working using an agile method should reinforces each other.
342
340
343
341
Open architectures should be concentrated around the following pillars:
344
342
345
- - Solutions should be created using FOSS system building blocks.
346
- - The created architecture blueprint is available for everyone. so use
343
+ * Solutions should be created using FOSS system building blocks.
344
+ * The created architecture blueprint is available for everyone. so use
347
345
a friendly (creative commons) license.
348
- - The architecture is developed from an open process in which everyone
346
+ * The architecture is developed in an open process in which everyone
349
347
participates to improve the architecture. E.g. also customers,
350
348
business stakeholders other stakeholders that will be impacted by
351
349
the architecture design in future. Borders that hinder participation
352
350
should be removed.
353
- - The architecture is based around good usable standards that anyone
351
+ * The architecture is based around good usable standards that anyone
354
352
can and may implement, use and improve. Unfortunate not all open
355
353
standards are really open and usable.
356
354
357
355
![ Open Architecture] ( /images/open-architecture.png )
358
356
359
- Green ML
360
- --------
357
+
358
+ ## Green ML
361
359
362
360
Applying new technology brings new responsibilities. Computations power
363
361
needed for deep learning research have been doubling every few months.
364
362
Machine learning computations can have a very large carbon footprint.
365
363
This is a results of the way most algorithms are designed.
366
364
367
- Most machine learning algorithms give only good results when large
365
+ Almost all machine learning algorithms give only good results when large
368
366
amounts of data are used and an enormous number of calculations are
369
367
performed. Computers do use a lot of energy when calculations at large
370
368
are performed.
@@ -375,7 +373,7 @@ computations can make it difficult for academics, students, and
375
373
researchers, in particular those from emerging economies, to engage in
376
374
deep learning research.
377
375
378
- Green machine learning means is machine learning optimized to minimize
376
+ Green machine learning means machine learning that is optimized to minimize
379
377
resource utilization and environmental impact. This can be done by data
380
378
center resource optimization, balancing training data requirements
381
379
versus accuracy, choosing less resource intensive models or in some
@@ -387,7 +385,7 @@ can bring should not harm the environment of all living cells that have
387
385
no direct relationship with your machine learning application.
388
386
389
387
The Freedom to use the powerful machine learning technology should not
390
- limit the freedom to live in good health of others. So green ML is a
388
+ limit the freedom to live in good health for others. So green ML is a
391
389
difficult but important aspects for machine learning developments. So
392
390
chose algorithms that perform well without weeks of calculation on
393
391
datasets. Or make sure expensive and time consuming calculations can be
0 commit comments