To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ??????汚????? 00111111001111110011111100111111001111110011111110001001100110000011111100111111001111110011111100111111 3f3f3f3f3f3f89983f3f3f3f3f
EUC-JP ???旿??汚??倻?? 0011111100111111001111111000111111000001111101000011111100111111101100011111100000111111001111111000111110110001111101100011111100111111 3f3f3f8fc1f43f3fb1f83f3f8fb1f63f3f
UTF-8 樂롳쉬旿삼슉汚꾦댋倻숃쪧 111011111010011010111111111010111010000110110011111011001000100110101100111001101001011110111111111011001000001010111100111011001000101010001001111001101011000110011010111010101011111010100110111010111000110010001011111001011000000010111011111011001000100010000011111011001010101010100111 efa6bfeba1b3ec89ace697bfec82bcec8a89e6b19aeabea6eb8c8be580bbec8883ecaaa7
UHC 樂롳쉬旿삼슉汚꾦댋倻숃쪧 111010001111100110001110111011111011110110101100111001111111101010111011111011111011110110110101111001111111110110000100111010011000100010110100111001011010011010011001111010001010010110100000 e8f98eefbdace7fabbefbdb5e7fd84e988b4e5a699e8a5a0

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)