To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 也ょオ???餓??瘟?????倭 10010110111001111000001011100101100000110100100100111111001111110011111110001001111011000011111100111111111000011000100100111111001111110011111100111111001111111001100001100000 96e782e583493f3f3f89ec3f3fe1893f3f3f3f3f9860
EUC-JP 也ょオ???餓??瘟??孼??倭 110011001110100110100100111001111010010110101010001111110011111100111111101100101110111000111111001111111110000111101001001111110011111110001111101110101100001100111111001111111100111111000001 cce9a4e7a5aa3f3f3fb2ee3f3fe1e93f3f8fbac33f3fcfc1
UTF-8 也ょオ呂묉짎餓뽩뜵瘟루떻孼껃렘倭 111001001011100110011111111000111000001010000111111000111000001010101010111011111010011010000000111010111010110010001001111011001010011110001110111010011010010010010011111010111011110110101001111010111001110010110101111001111001100010011111111010111010001110101000111010111001011010111011111001011010110110111100111010101011101110000011111010111010000010011000111001011000000010101101 e4b99fe38287e382aaefa680ebac89eca78ee9a493ebbda9eb9cb5e7989feba3a8eb96bbe5adbceabb83eba098e580ad
UHC 也ょオ呂묉짎餓뽩뜵瘟루떻孼껃렘倭 1110010110100101101010101110011110101011101010101110010111111011100100011110011010100011100110101110010010111011100101101110010110001101101100111110100010110000101101111110011110110110101110111110010111101101100000111110010110110111101111011110100011011110 e5a5aae7abaae5fb91e6a39ae4bb96e58db3e8b0b7e7b6bbe5ed83e5b7bde8de

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)