New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Feature/oap/cluster plugin #1440

Merged

wu-sheng merged 11 commits into apache:6.0 from peng-yongsheng:feature/oap/cluster_plugin

Jul 11, 2018

Member

peng-yongsheng commented Jul 10, 2018

Please answer these questions before submitting the pull request

Why submit this pull request?
Bug fix
New feature provided
Improve performance

New feature or improvement

Describe the details and related test reports.

Cluster management implementation.

peng-yongsheng added 6 commits

July 9, 2018 22:20


          Cluster module implementation finished.

bd20b91


          Add test case of cluster module which implement by zookeeper.

082f9e2


          Use the environment variable to setting dependency component version.

31b0931


          Merge branch '6.0' into feature/oap/cluster_plugin

897cfb7


          Cluster management design doc.

2b65f26


          Rename module and package name.

87fbda5

peng-yongsheng added core feature backend labels

peng-yongsheng added this to the 6.0.0-preview milestone

Member Author

peng-yongsheng commented Jul 10, 2018

Some test cases unfinished, I will supplement them later.

wu-sheng reviewed

View reviewed changes

Member

wu-sheng left a comment

Some suggestions and questions have been added to the comments.

docs/en/concepts-and-designs/OAP-Cluster-Management.md Outdated

		@@ -0,0 +1,39 @@
		# Observability Analysis Platform Cluster Management
		OAP(Observability Analysis Platform) server is a distributed system, services need to find each

Member

wu-sheng Jul 10, 2018

In the document, we are using backend to represent the collector or server. Let's keep it same.

docs/en/concepts-and-designs/OAP-Cluster-Management.md Outdated

+              OAP(Observability Analysis Platform) server is a distributed system, services need to find each
+              other. i.e. a web service needs to find query service, level 1 aggregate service need to find
+              level 2 aggregate service. Cluster management is just a client-side implementation. It must work
+              together with a distributed coordination server, (i.e. Zookeeper, Consul, Kubernetes.) unless

Member

wu-sheng Jul 10, 2018

Should be distributed coordination service. Service is Zookeeper says for itself. Ref https://zookeeper.apache.org/

docs/en/concepts-and-designs/OAP-Cluster-Management.md Outdated

+              ### Cluster Management Plugins
+              By default, OAP server provides two implementations for cluster management, which are standalone
+              and zookeeper. When the applications being monitored are small, you can choose the standalone mode.

Member

wu-sheng Jul 10, 2018

When the scale of services, which are under monitoring, is small,

docs/en/concepts-and-designs/OAP-Cluster-Management.md Outdated

+              ### Cluster Management Plugins
+              By default, OAP server provides two implementations for cluster management, which are standalone
+              and zookeeper. When the applications being monitored are small, you can choose the standalone mode.
+              If they are big, you must choose the cluster mode by zookeeper plugin, or you can implement another

Member

wu-sheng Jul 10, 2018

If they are big - > Otherwise
It(Scale) is big.
Remove ~~another~~

docs/en/concepts-and-designs/OAP-Cluster-Management.md Outdated

+              There are two interfaces defined beforehand in the OAP server core, which are Module register and
+              module query, all the cluster management plugins must implement those two interfaces.
+              * Module Register: When any modules which need to provide services for each other, those modules

Member

wu-sheng Jul 10, 2018

When any module needs to provide services for others, the module must do register through this interface.
The first half of adjustment is about the language only, the second half is that Cluster Management Interface can't guarantee the register will be into service discovery service. Like standalone is no service existed outside, Kubernetes is doing service instance management by itself, no real register happens.

docs/en/concepts-and-designs/OAP-Cluster-Management.md Outdated

+              * Module Register: When any modules which need to provide services for each other, those modules
+              must invoke this interface to register itself into the service discovery server.
+              * Module Query: When any modules which need to call each other, those modules must retrieve the
+              module set by this interface.

Member

wu-sheng Jul 10, 2018

When any modules which need to call each other -> Same as above. No need which. each other means in the same group or set, but in this case, they are not.
The descriptions of Register and Query should be from the provider side. Such as When any module needs to find other services, it can use this interface to retrieve the service instance list in the certain order.

docs/en/concepts-and-designs/OAP-Cluster-Management.md Outdated

+              * Module Query: When any modules which need to call each other, those modules must retrieve the
+              module set by this interface.
+              ### Process Flow Between Client and Cluster Management

Member

wu-sheng Jul 10, 2018

Add the following description:
The client has two ways to connect the backend, one, use the direct link by a set of backend instance endpoint list, or you can use naming service, which considers your given list is just the seed nodes of the whole cluster. The following graph is showing you how naming service works. You need to check the probe documents to know which way(s) is(are) supported.

...main/java/org/apache/skywalking/oap/server/cluster/plugin/zookeeper/ServiceCacheManager.java Outdated

+                  private final Map<String, ServiceCache<InstanceDetails>> serviceCacheMap;
+                  public ServiceCacheManager() {
+                      this.serviceCacheMap = new HashMap<>();

Member

wu-sheng Jul 10, 2018

Are you sure about the OP on put is synchronized already? No concurrency?

Member

wu-sheng Jul 10, 2018

I just can know there should be multiple service names in this Map, but do they just be added during startup?

...ain/java/org/apache/skywalking/oap/server/cluster/plugin/zookeeper/ZookeeperModuleQuery.java

+                  public List<InstanceDetails> query(String moduleName, String providerName) throws ServiceRegisterException {
+                      List<ServiceInstance<InstanceDetails>> serviceInstances = cacheManager.get(NodeNameBuilder.build(moduleName, providerName)).getInstances();
+                      List<InstanceDetails> instanceDetails = new ArrayList<>(serviceInstances.size());

Member

wu-sheng Jul 10, 2018

Is this a high-frequency call? Will query be called everytime you do secondary aggregation? If so, be careful of this ArrayList.

Member Author

peng-yongsheng Jul 11, 2018

The ArrayList is a fixed size set.

apm-collector/apm-collector-boot/src/main/resources/application.yml Outdated

+              cluster:
+                zookeeper:
+                  hostPort: localhost:2181
+                  sessionTimeout: 100000

Member

wu-sheng Jul 10, 2018

Any purpose to change this?

peng-yongsheng added 4 commits

July 11, 2018 10:12


          Fixed the rat check error.

ff6fd23


          Add -X parameter into mvn command to print error detail.

a3fb81a


          Add dependency of puppycrawl for check style.

c94d751


          Fixed check style error.

d88f933

coveralls commented Jul 11, 2018 •

edited

Loading

Coverage increased (+0.6%) to 25.07% when pulling b1b5013 on peng-yongsheng:feature/oap/cluster_plugin into a26e6af on apache:6.0.


          Fixed doc and code issue for the comment by WuSheng.

b1b5013

wu-sheng merged commit dcdfeb1 into apache:6.0

peng-yongsheng deleted the feature/oap/cluster_plugin branch

July 30, 2018 02:34

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

backend core feature