Description
Hi Team,
I'm trying to get the most basic deployment of Confluent going with two servers directly connected via ethernet cables (no switches or anything in the way).
Our desired configuration will use HTTP boot over UEFI, so I am attempting to set this up without DNS or DHCP in the first instance to trial node deployment (and image management) before we scale out to one of our smaller clusters.
I have been able to define my node, automatically discover it's MAC, and assign that MAC to the the defined node (I have plugged in a second interface to make sure the shared ILO port is not the issue here). My deployment interface(s) (not defined anywhere...) have IPv6 enabled and link-local (FE80
) addresses.
root@headnode:~# nodediscover list
Node| Model| Serial| UUID| Mac Address| Type| Current IP Addresses
-----|------|-------|-------------------------------------|------------------|-----------|---------------------
t1| | | 36334c44-4730-xxxx-yyyy-4e50ax373134| xx:xx:xx:d2:b4:c0| pxe-client|
t1| | | 36334c44-4730-xxxx-yyyy-4e50ax373134| xx:xx:xx:d2:b4:c1| pxe-client|
I have defined the client node to boot only from HTTP(S) UEFI options.
I see the following via tcpdump
of the deployment interface (limited by node MAC) when the node attempts to HTTP boot off that port:
tcpdump -i eno2 -nnnnn ether host xx:xx:xx:d2:b4:c1
. . .
16:38:04.802851 IP6 fe80::2267:7cff:fed2:b4e1.547 > fe80::2267:7cff:fed2:b4c1.546: dhcp6 [|dhcp6]
16:38:08.951124 IP6 fe80::2267:7cff:fed2:b4c1.546 > ff02::1:2.547: dhcp6 solicit
16:38:09.749800 IP6 fe80::2267:7cff:fed2:b4c1 > fe80::2267:7cff:fed2:b4e1: ICMP6, neighbor solicitation, who has fe80::2267:7cff:fed2:b4e1, length 32
16:38:09.749839 IP6 fe80::2267:7cff:fed2:b4e1 > fe80::2267:7cff:fed2:b4c1: ICMP6, neighbor advertisement, tgt is fe80::2267:7cff:fed2:b4e1, length 24
16:38:15.169719 IP6 fe80::2267:7cff:fed2:b4e1 > fe80::2267:7cff:fed2:b4c1: ICMP6, neighbor solicitation, who has fe80::2267:7cff:fed2:b4c1, length 32
16:38:15.169851 IP6 fe80::2267:7cff:fed2:b4c1 > fe80::2267:7cff:fed2:b4e1: ICMP6, neighbor advertisement, tgt is fe80::2267:7cff:fed2:b4c1, length 32
16:38:17.950750 IP6 fe80::2267:7cff:fed2:b4c1.546 > ff02::1:2.547: dhcp6 solicit
16:38:34.950175 IP6 fe80::2267:7cff:fed2:b4c1.546 > ff02::1:2.547: dhcp6 solicit
16:40:25.807578 IP6 fe80::2267:7cff:fed2:b4c1.546 > ff02::1:2.547: dhcp6 solicit
16:40:29.943707 IP6 fe80::2267:7cff:fed2:b4c1.546 > ff02::1:2.547: dhcp6 solicit
16:40:38.943350 IP6 fe80::2267:7cff:fed2:b4c1.546 > ff02::1:2.547: dhcp6 solicit
16:40:55.942762 IP6 fe80::2267:7cff:fed2:b4c1.546 > ff02::1:2.547: dhcp6 solicit
16:42:46.786935 IP6 fe80::2267:7cff:fed2:b4c1.546 > ff02::1:2.547: dhcp6 solicit
16:42:50.936298 IP6 fe80::2267:7cff:fed2:b4c1.546 > ff02::1:2.547: dhcp6 solicit
16:42:59.935936 IP6 fe80::2267:7cff:fed2:b4c1.546 > ff02::1:2.547: dhcp6 solicit
16:43:16.935357 IP6 fe80::2267:7cff:fed2:b4c1.546 > ff02::1:2.547: dhcp6 solicit
16:45:07.779679 IP6 fe80::2267:7cff:fed2:b4c1.546 > ff02::1:2.547: dhcp6 solicit
16:45:07.780750 IP6 fe80::2267:7cff:fed2:b4c1 > fe80::2267:7cff:fed2:b4e1: ICMP6, neighbor advertisement, tgt is fe80::2267:7cff:fed2:b4c1, length 32
16:45:07.780774 IP6 fe80::2267:7cff:fed2:b4e1.547 > fe80::2267:7cff:fed2:b4c1.546: dhcp6 [|dhcp6]
16:45:11.928902 IP6 fe80::2267:7cff:fed2:b4c1.546 > ff02::1:2.547: dhcp6 solicit
16:45:12.727582 IP6 fe80::2267:7cff:fed2:b4c1 > fe80::2267:7cff:fed2:b4e1: ICMP6, neighbor solicitation, who has fe80::2267:7cff:fed2:b4e1, length 32
16:45:12.727606 IP6 fe80::2267:7cff:fed2:b4e1 > fe80::2267:7cff:fed2:b4c1: ICMP6, neighbor advertisement, tgt is fe80::2267:7cff:fed2:b4e1, length 24
16:45:20.928531 IP6 fe80::2267:7cff:fed2:b4c1.546 > ff02::1:2.547: dhcp6 solicit
16:45:37.927937 IP6 fe80::2267:7cff:fed2:b4c1.546 > ff02::1:2.547: dhcp6 solicit
16:45:37.928998 IP6 fe80::2267:7cff:fed2:b4c1 > fe80::2267:7cff:fed2:b4e1: ICMP6, neighbor advertisement, tgt is fe80::2267:7cff:fed2:b4c1, length 32
16:45:37.929023 IP6 fe80::2267:7cff:fed2:b4e1.547 > fe80::2267:7cff:fed2:b4c1.546: dhcp6 [|dhcp6]
16:45:42.925999 IP6 fe80::2267:7cff:fed2:b4c1 > fe80::2267:7cff:fed2:b4e1: ICMP6, neighbor solicitation, who has fe80::2267:7cff:fed2:b4e1, length 32
16:45:42.926021 IP6 fe80::2267:7cff:fed2:b4e1 > fe80::2267:7cff:fed2:b4c1: ICMP6, neighbor advertisement, tgt is fe80::2267:7cff:fed2:b4e1, length 24
16:47:28.784301 IP6 fe80::2267:7cff:fed2:b4c1.546 > ff02::1:2.547: dhcp6 solicit
16:47:32.922494 IP6 fe80::2267:7cff:fed2:b4c1.546 > ff02::1:2.547: dhcp6 solicit
16:47:41.921126 IP6 fe80::2267:7cff:fed2:b4c1.546 > ff02::1:2.547: dhcp6 solicit
16:47:58.920540 IP6 fe80::2267:7cff:fed2:b4c1.546 > ff02::1:2.547: dhcp6 solicit
Confluent appears to be listening on the DHCP6 port:
root@headnode:~# ss -tulpn
Netid State Recv-Q Send-Q Local Address:Port Peer Address:Port Process
udp UNCONN 0 0 0.0.0.0:44939 0.0.0.0:* users:(("avahi-daemon",pid=920,fd=14))
udp UNCONN 0 0 127.0.0.54:53 0.0.0.0:* users:(("systemd-resolve",pid=910,fd=16))
udp UNCONN 0 0 127.0.0.53%lo:53 0.0.0.0:* users:(("systemd-resolve",pid=910,fd=14))
udp UNCONN 0 0 0.0.0.0:67 0.0.0.0:* users:(("confluent",pid=1395,fd=15))
udp UNCONN 0 0 0.0.0.0:427 0.0.0.0:* users:(("confluent",pid=1395,fd=14))
udp UNCONN 0 0 0.0.0.0:631 0.0.0.0:* users:(("cups-browsed",pid=2055,fd=7))
udp UNCONN 0 0 0.0.0.0:1900 0.0.0.0:* users:(("confluent",pid=1395,fd=26))
udp UNCONN 0 0 0.0.0.0:4011 0.0.0.0:* users:(("confluent",pid=1395,fd=19))
udp UNCONN 0 0 0.0.0.0:5353 0.0.0.0:* users:(("avahi-daemon",pid=920,fd=12))
udp UNCONN 0 0 [::]:41150 [::]:* users:(("avahi-daemon",pid=920,fd=15))
udp UNCONN 0 0 *:59932 *:* users:(("confluent",pid=1395,fd=20))
udp UNCONN 0 0 [::]:427 [::]:* users:(("confluent",pid=1395,fd=13))
udp UNCONN 0 0 *:547 *:* users:(("confluent",pid=1395,fd=16))
udp UNCONN 0 0 [::]:1900 [::]:*
And the node is primed for deployment:
# nodedeploy t1
t1: pending: ubuntu-24.04.1-x86_64-default (node authentication armed)
I never see any response from my Confluent head node telling this node where to boot. I assume this is all supposed to happen over Layer 2 IPv6 "magic" based on the sparse confluent docs that exist. What have I missed here?
Edit:
Head Node Details:
- Ubuntu 24.04
- Confluent installed via Apt:
confluent-server/unknown,now 3.11.1-1 all [installed,automatic]
confluent systems management server
- No firewall enabled on the host.