To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???循?│遺??? 00111111001111110011111110001111011110100011111110000100101000001000100011100010001111110011111100111111 3f3f3f8f7a3f84a088e23f3f3f
EUC-JP ???循?│遺??? 00111111001111110011111110111101110110110011111110101000101000101011000011100100001111110011111100111111 3f3f3fbddb3fa8a2b0e43f3f3f
UTF-8 若뱀꼯循븝│遺븍츝若 111011111010010110110100111010111011000110000000111010101011110010101111111001011011111010101010111010111011100010011101111000101001010010000010111010011000000110111010111010111011100010001101111011001011100010011101111011111010010110110100 efa5b4ebb180eabcafe5beaaebb89de29482e981baebb88decb89defa5b4
UHC 若뱀꼯循븝│遺븍츝若 1110010110101110101110011110110010000100100010101110001011100000101110101110111110100110101000101110101110110110101110101110101110101110100101101110010110101110 e5aeb9ec848ae2e0baefa6a2ebb6baebae96e5ae

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)