To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ??シ癲??應у?? 0011111100111111100000110101011011100001100111110011111100111111100111001110010010000100100001010011111100111111 3f3f8356e19f3f3f9ce484853f3f
EUC-JP ??シ癲??應у?? 0011111100111111101001011011011111100010101000010011111100111111110110001110011010100111111001010011111100111111 3f3fa5b7e2a13f3fd8e6a7e53f3f
UTF-8 琉뗨シ癲욏녃應у쯃若 1110111110100111100011001110101110010111101010001110001110000010101101111110011110011001101100101110110010011010100011111110101110000101100000111110011010000111100010011101000110000011111011001010111110000011111011111010010110110100 efa78ceb97a8e382b7e799b2ec9a8feb8583e68789d183ecaf83efa5b4
UHC 琉뗨シ癲욏녃應у쯃若 1110101110100100100010111110100010101011101101111110111110100110100111101110110110000110101110111110101111101011101011001110010110101000100111111110010110101110 eba48be8abb7efa69eed86bbebebace5a89fe5ae

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)