Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

update contents #25

Merged
merged 1 commit into from
Jul 11, 2024
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
Binary file added doc/_static/book.jpeg
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
16 changes: 8 additions & 8 deletions doc/_toc.yml
Original file line number Diff line number Diff line change
Expand Up @@ -4,7 +4,7 @@ subtrees:
entries:
- file: ch-big-data-intro/index
entries:
- file: ch-big-data-intro/sec-bigdata
- file: ch-big-data-intro/bigdata
- file: ch-big-data-intro/batch-stream
- file: ch-big-data-intro/technologies
- file: ch-big-data-intro/evolution
Expand Down Expand Up @@ -46,10 +46,10 @@ subtrees:
- file: ch-state-checkpoint/exercise-state
- file: ch-flink-connectors/index
entries:
- file: ch-flink-connectors/Exactly-Once-guarantee
- file: ch-flink-connectors/exactly-once
- file: ch-flink-connectors/custom-source-and-sink
- file: ch-flink-connectors/flink-connector
- file: ch-flink-connectors/exercise-stock-price-data-stream
- file: ch-flink-connectors/exercise-stock-price-with-source-sink
- file: ch-table-sql/index
entries:
- file: ch-table-sql/table-overview
Expand All @@ -62,9 +62,9 @@ subtrees:
- file: ch-table-sql/exercise-iot
- file: ch-deployment-and-configuration/index
entries:
- file: ch-deployment-and-configuration/flink-deployment-and-configuration
- file: ch-deployment-and-configuration/deployment-and-configuration
- file: ch-deployment-and-configuration/configuration-file
- file: ch-deployment-and-configuration/operator-chaining-and-slot-sharing
- file: ch-deployment-and-configuration/flink-command-line-interface-guide
- file: ch-deployment-and-configuration/hadoop-flink
- file: ch-deployment-and-configuration/experiment-job-coding-packaging-and-submission
- file: ch-deployment-and-configuration/operator-chaining-slot-sharing
- file: ch-deployment-and-configuration/cli
- file: ch-deployment-and-configuration/flink-with-hadoop
- file: ch-deployment-and-configuration/exercise-job-submission
File renamed without changes.
2 changes: 1 addition & 1 deletion doc/ch-big-data-intro/exercise-stream-with-kafka.md
Original file line number Diff line number Diff line change
@@ -1,5 +1,5 @@
(exercise-stream-with-kafka)=
# 案例实战:使用 Kafka 构建文本数据流
# 案例:使用 Kafka 构建文本数据流

尽管本书主题是 Flink,但是对数据流的整个生命周期有一个更全面的认识有助于我们理解大数据和流处理。{numref}`technologies` 简单介绍了 Kafka 这项技术,本节将介绍如何使用 Kafka 构建实时文本数据流,读者可以通过本节了解数据流管道的大致结构:数据生产者源源不断地生成数据流,数据流通过消息队列投递,数据消费者异步地对数据流进行处理。

Expand Down
2 changes: 1 addition & 1 deletion doc/ch-datastream-api/exercise-stock-basic.md
Original file line number Diff line number Diff line change
@@ -1,5 +1,5 @@
(exercise-stock-basic)=
# 习题 股票数据流处理
# 案例实战:股票数据流处理

:::{note}

Expand Down
Original file line number Diff line number Diff line change
@@ -1,4 +1,4 @@
(flink-command-line-interface-guide)=
(sec-cli)=
# 命令行工具

在生产环境中,Flink 使用命令行工具(Command Line Interface)来管理作业的执行。命令行工具本质上是一个可执行脚本,名为 flink,放置在 Flink 的主目录下的 bin 文件夹中。它的功能主要包括:提交、取消作业,罗列当前正在执行和排队的作业、获取某个作业的信息,设置 Savepoint 等。
Expand Down
2 changes: 1 addition & 1 deletion doc/ch-deployment-and-configuration/configuration-file.md
Original file line number Diff line number Diff line change
@@ -1,4 +1,4 @@
(configuration-file)=
(sec-configuration-file)=
# 配置文件

在前文的介绍中,我们曾多次提到 Flink 主目录下的 `conf/flink-conf.yaml` 文件,这个文件在作业配置中起到了至关重要的作用。
Expand Down
Original file line number Diff line number Diff line change
@@ -1,4 +1,4 @@
(flink-deployment-and-configuration)=
(sec-deployment-and-configuration)=
# Flink 集群部署模式

当前,信息系统基础设施正在飞速发展,常见的基础设施包括物理机集群、虚拟机集群、容器集群等。为了兼容这些基础设施,Flink 曾在 1.7 版本中做了重构,提出了第 3 章中所示的 Master-Worker 架构,该架构可以兼容几乎所有主流信息系统的基础设施,包括 Standalone 集群、Hadoop YARN 集群或 Kubernetes 集群。
Expand Down
Original file line number Diff line number Diff line change
@@ -1,5 +1,5 @@
(experiment-job-coding-packaging-and-submission)=
# 实验 作业编码、打包与提交
(sec-exercise-job-submission)=
# 案例实战:作业编码、打包与提交

本章的重点是集群部署与作业提交,本实验也与此相关。

Expand Down
Original file line number Diff line number Diff line change
@@ -1,4 +1,4 @@
(hadoop-flink)=
(sec-flink-with-hadoop)=
# 与 Hadoop 集成

Flink 可以和 Hadoop 生态圈的组件紧密结合,比如 9.1 节中提到,Flink 可以使用 YARN 作为资源调度器,或者读取 HDFS、HBase 中的数据。在使用 Hadoop 前,我们需要确认已经安装了 Hadoop,并配置了环境变量 `HADOOP_CONF_DIR`,如下环境变量配置是 Hadoop 安装过程所必需的。
Expand Down
Original file line number Diff line number Diff line change
@@ -1,4 +1,4 @@
(operator-chaining-and-slot-sharing)=
(sec-operator-chaining-slot-sharing)=
# 算子链与槽位共享

在第 3 章中我们曾介绍了算子链和槽位共享的概念。默认情况下,这两个功能都是开启的。
Expand Down
3 changes: 2 additions & 1 deletion doc/ch-flink-connectors/custom-source-and-sink.md
Original file line number Diff line number Diff line change
@@ -1,4 +1,5 @@
(custom-source-and-sink)=
(sec-custom-source-and-sink)=

# 自定义 Source 和 Sink

本节将从原理和实现两个方面来介绍 Flink 的 Source 和 Sink。
Expand Down
Original file line number Diff line number Diff line change
@@ -1,4 +1,4 @@
(Exactly-Once-guarantee)=
(sec-exactly-once)=
# Flink 端到端的 Exactly-Once 保障

## 故障恢复与一致性保障
Expand Down
Original file line number Diff line number Diff line change
@@ -1,5 +1,5 @@
(exercise-stock-price-data-stream)=
# 实验 读取并输出股票价格数据流
(sec-exercise-stock-price-with-source-sink)=
# 案例实战:读取并输出股票价格数据流

经过本章的学习,读者应该基本了解了 Flink Connector 的使用方法,本节我们继续以股票交易场景来模拟数据流的输入和输出。

Expand Down
2 changes: 1 addition & 1 deletion doc/ch-programming-basics/exercise-dev-environment.md
Original file line number Diff line number Diff line change
@@ -1,5 +1,5 @@
(exercise-dev-environment)=
# 案例实战 Flink 开发环境搭建
# 案例实战Flink 开发环境搭建

本案例实战主要带领读者完成对 Flink 开发环境的搭建。

Expand Down
2 changes: 1 addition & 1 deletion doc/ch-state-checkpoint/exercise-state.md
Original file line number Diff line number Diff line change
@@ -1,5 +1,5 @@
(exercise-state)=
# 练习:电商用户行为
# 案例实战:电商用户行为

:::{note}

Expand Down
2 changes: 1 addition & 1 deletion doc/ch-system-design/exercise-wordcount.md
Original file line number Diff line number Diff line change
@@ -1,5 +1,5 @@
(exercise-wordcount)=
# 练习:WordCount
# 案例实战:WordCount

:::{note}

Expand Down
2 changes: 1 addition & 1 deletion doc/ch-table-sql/exercise-iot.md
Original file line number Diff line number Diff line change
@@ -1,5 +1,5 @@
(exercise-iot)=
# 练习:SQL on IoT
# 案例实战:SQL on IoT

:::{note}

Expand Down
2 changes: 1 addition & 1 deletion doc/ch-time-window/exercise-stock.md
Original file line number Diff line number Diff line change
@@ -1,5 +1,5 @@
(exercise-stock)=
# 练习:股票价格
# 案例实战:股票价格

:::{note}

Expand Down
22 changes: 7 additions & 15 deletions doc/index.md
Original file line number Diff line number Diff line change
Expand Up @@ -7,6 +7,9 @@
:columns: 3
:class: sd-m-auto

```{image} ./_static/book.jpeg
:width: 150px
```

:::

Expand All @@ -19,6 +22,9 @@
% The SVG rendering breaks latex builds for the GitHub badge, so only include in HTML
```{only} html
[![](https://img.shields.io/github/stars/godaai/flink-book-zh?style=for-the-badge)](https://github.com/godaai/flink-book-zh)

[![](https://img.shields.io/badge/JD-%E8%B4%AD%E4%B9%B0%E9%93%BE%E6%8E%A5-red?style=for-the-badge)](https://item.jd.com/13154364.html)

```

:::
Expand All @@ -27,21 +33,7 @@

## 主要作者

::::{grid}
:class-container: text-center
:gutter: 3

:::{grid-item-card}
:link-type: doc
:class-header: bg-light

鲁蔚征
^^^

现就职于中国人民大学,CCF高性能计算专业委员会执行委员,主要研究大数据与机器学习系统。
:::

::::
* 鲁蔚征,现就职于中国人民大学,CCF高性能计算专业委员会执行委员,主要研究大数据与机器学习系统。

## 赞赏与支持

Expand Down
Loading