Skip to content

Troubleshooting IPv6 HTTP(S) deployment #166

Open
@Kangie

Description

@Kangie

Hi Team,

I'm trying to get the most basic deployment of Confluent going with two servers directly connected via ethernet cables (no switches or anything in the way).

Our desired configuration will use HTTP boot over UEFI, so I am attempting to set this up without DNS or DHCP in the first instance to trial node deployment (and image management) before we scale out to one of our smaller clusters.

I have been able to define my node, automatically discover it's MAC, and assign that MAC to the the defined node (I have plugged in a second interface to make sure the shared ILO port is not the issue here). My deployment interface(s) (not defined anywhere...) have IPv6 enabled and link-local (FE80) addresses.

root@headnode:~# nodediscover list
 Node| Model| Serial|                                 UUID|       Mac Address|       Type| Current IP Addresses
-----|------|-------|-------------------------------------|------------------|-----------|---------------------
   t1|      |       | 36334c44-4730-xxxx-yyyy-4e50ax373134| xx:xx:xx:d2:b4:c0| pxe-client|                     
   t1|      |       | 36334c44-4730-xxxx-yyyy-4e50ax373134| xx:xx:xx:d2:b4:c1| pxe-client|                     

I have defined the client node to boot only from HTTP(S) UEFI options.

I see the following via tcpdump of the deployment interface (limited by node MAC) when the node attempts to HTTP boot off that port:

tcpdump -i eno2 -nnnnn  ether host xx:xx:xx:d2:b4:c1
. . .
16:38:04.802851 IP6 fe80::2267:7cff:fed2:b4e1.547 > fe80::2267:7cff:fed2:b4c1.546: dhcp6 [|dhcp6]
16:38:08.951124 IP6 fe80::2267:7cff:fed2:b4c1.546 > ff02::1:2.547: dhcp6 solicit
16:38:09.749800 IP6 fe80::2267:7cff:fed2:b4c1 > fe80::2267:7cff:fed2:b4e1: ICMP6, neighbor solicitation, who has fe80::2267:7cff:fed2:b4e1, length 32
16:38:09.749839 IP6 fe80::2267:7cff:fed2:b4e1 > fe80::2267:7cff:fed2:b4c1: ICMP6, neighbor advertisement, tgt is fe80::2267:7cff:fed2:b4e1, length 24
16:38:15.169719 IP6 fe80::2267:7cff:fed2:b4e1 > fe80::2267:7cff:fed2:b4c1: ICMP6, neighbor solicitation, who has fe80::2267:7cff:fed2:b4c1, length 32
16:38:15.169851 IP6 fe80::2267:7cff:fed2:b4c1 > fe80::2267:7cff:fed2:b4e1: ICMP6, neighbor advertisement, tgt is fe80::2267:7cff:fed2:b4c1, length 32
16:38:17.950750 IP6 fe80::2267:7cff:fed2:b4c1.546 > ff02::1:2.547: dhcp6 solicit
16:38:34.950175 IP6 fe80::2267:7cff:fed2:b4c1.546 > ff02::1:2.547: dhcp6 solicit
16:40:25.807578 IP6 fe80::2267:7cff:fed2:b4c1.546 > ff02::1:2.547: dhcp6 solicit
16:40:29.943707 IP6 fe80::2267:7cff:fed2:b4c1.546 > ff02::1:2.547: dhcp6 solicit
16:40:38.943350 IP6 fe80::2267:7cff:fed2:b4c1.546 > ff02::1:2.547: dhcp6 solicit
16:40:55.942762 IP6 fe80::2267:7cff:fed2:b4c1.546 > ff02::1:2.547: dhcp6 solicit
16:42:46.786935 IP6 fe80::2267:7cff:fed2:b4c1.546 > ff02::1:2.547: dhcp6 solicit
16:42:50.936298 IP6 fe80::2267:7cff:fed2:b4c1.546 > ff02::1:2.547: dhcp6 solicit
16:42:59.935936 IP6 fe80::2267:7cff:fed2:b4c1.546 > ff02::1:2.547: dhcp6 solicit
16:43:16.935357 IP6 fe80::2267:7cff:fed2:b4c1.546 > ff02::1:2.547: dhcp6 solicit
16:45:07.779679 IP6 fe80::2267:7cff:fed2:b4c1.546 > ff02::1:2.547: dhcp6 solicit
16:45:07.780750 IP6 fe80::2267:7cff:fed2:b4c1 > fe80::2267:7cff:fed2:b4e1: ICMP6, neighbor advertisement, tgt is fe80::2267:7cff:fed2:b4c1, length 32
16:45:07.780774 IP6 fe80::2267:7cff:fed2:b4e1.547 > fe80::2267:7cff:fed2:b4c1.546: dhcp6 [|dhcp6]
16:45:11.928902 IP6 fe80::2267:7cff:fed2:b4c1.546 > ff02::1:2.547: dhcp6 solicit
16:45:12.727582 IP6 fe80::2267:7cff:fed2:b4c1 > fe80::2267:7cff:fed2:b4e1: ICMP6, neighbor solicitation, who has fe80::2267:7cff:fed2:b4e1, length 32
16:45:12.727606 IP6 fe80::2267:7cff:fed2:b4e1 > fe80::2267:7cff:fed2:b4c1: ICMP6, neighbor advertisement, tgt is fe80::2267:7cff:fed2:b4e1, length 24
16:45:20.928531 IP6 fe80::2267:7cff:fed2:b4c1.546 > ff02::1:2.547: dhcp6 solicit
16:45:37.927937 IP6 fe80::2267:7cff:fed2:b4c1.546 > ff02::1:2.547: dhcp6 solicit
16:45:37.928998 IP6 fe80::2267:7cff:fed2:b4c1 > fe80::2267:7cff:fed2:b4e1: ICMP6, neighbor advertisement, tgt is fe80::2267:7cff:fed2:b4c1, length 32
16:45:37.929023 IP6 fe80::2267:7cff:fed2:b4e1.547 > fe80::2267:7cff:fed2:b4c1.546: dhcp6 [|dhcp6]
16:45:42.925999 IP6 fe80::2267:7cff:fed2:b4c1 > fe80::2267:7cff:fed2:b4e1: ICMP6, neighbor solicitation, who has fe80::2267:7cff:fed2:b4e1, length 32
16:45:42.926021 IP6 fe80::2267:7cff:fed2:b4e1 > fe80::2267:7cff:fed2:b4c1: ICMP6, neighbor advertisement, tgt is fe80::2267:7cff:fed2:b4e1, length 24
16:47:28.784301 IP6 fe80::2267:7cff:fed2:b4c1.546 > ff02::1:2.547: dhcp6 solicit
16:47:32.922494 IP6 fe80::2267:7cff:fed2:b4c1.546 > ff02::1:2.547: dhcp6 solicit
16:47:41.921126 IP6 fe80::2267:7cff:fed2:b4c1.546 > ff02::1:2.547: dhcp6 solicit
16:47:58.920540 IP6 fe80::2267:7cff:fed2:b4c1.546 > ff02::1:2.547: dhcp6 solicit

Confluent appears to be listening on the DHCP6 port:

root@headnode:~# ss -tulpn
Netid           State            Recv-Q           Send-Q                     Local Address:Port                       Peer Address:Port           Process                                                                                                                                           
udp             UNCONN           0                0                                0.0.0.0:44939                           0.0.0.0:*               users:(("avahi-daemon",pid=920,fd=14))                                                                                                           
udp             UNCONN           0                0                             127.0.0.54:53                              0.0.0.0:*               users:(("systemd-resolve",pid=910,fd=16))                                                                                                        
udp             UNCONN           0                0                          127.0.0.53%lo:53                              0.0.0.0:*               users:(("systemd-resolve",pid=910,fd=14))                                                                                                        
udp             UNCONN           0                0                                0.0.0.0:67                              0.0.0.0:*               users:(("confluent",pid=1395,fd=15))                                                                                                             
udp             UNCONN           0                0                                0.0.0.0:427                             0.0.0.0:*               users:(("confluent",pid=1395,fd=14))                                                                                                             
udp             UNCONN           0                0                                0.0.0.0:631                             0.0.0.0:*               users:(("cups-browsed",pid=2055,fd=7))                                                                                                           
udp             UNCONN           0                0                                0.0.0.0:1900                            0.0.0.0:*               users:(("confluent",pid=1395,fd=26))                                                                                                             
udp             UNCONN           0                0                                0.0.0.0:4011                            0.0.0.0:*               users:(("confluent",pid=1395,fd=19))                                                                                                             
udp             UNCONN           0                0                                0.0.0.0:5353                            0.0.0.0:*               users:(("avahi-daemon",pid=920,fd=12))                                                                                                           
udp             UNCONN           0                0                                   [::]:41150                              [::]:*               users:(("avahi-daemon",pid=920,fd=15))                                                                                                           
udp             UNCONN           0                0                                      *:59932                                 *:*               users:(("confluent",pid=1395,fd=20))                                                                                                             
udp             UNCONN           0                0                                   [::]:427                                [::]:*               users:(("confluent",pid=1395,fd=13))                                                                                                             
udp             UNCONN           0                0                                      *:547                                   *:*               users:(("confluent",pid=1395,fd=16))                                                                                                             
udp             UNCONN           0                0                                   [::]:1900                               [::]:*              

And the node is primed for deployment:

# nodedeploy t1
t1: pending: ubuntu-24.04.1-x86_64-default (node authentication armed)

I never see any response from my Confluent head node telling this node where to boot. I assume this is all supposed to happen over Layer 2 IPv6 "magic" based on the sparse confluent docs that exist. What have I missed here?

Edit:

Head Node Details:

  • Ubuntu 24.04
  • Confluent installed via Apt:
confluent-server/unknown,now 3.11.1-1 all [installed,automatic]
  confluent systems management server
  • No firewall enabled on the host.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions