To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???違??矣?き碍??釗??循??鶯?? 00111111001111110011111110001000111000010011111100111111111000011110000100111111100000101010101110001010010101100011111100111111111110111011101100111111001111111000111101111010001111110011111111101001111100100011111100111111 3f3f3f88e13f3fe1e13f82ab8a563f3ffbbb3f3f8f7a3f3fe9f23f3f
EUC-JP ???違??矣?き碍?Ŧ釗??循??鶯?? 00111111001111110011111110110000111000110011111100111111111000101110001100111111101001001010110110110011101101110011111110001111101010011010111110001111111000111010011000111111001111111011110111011011001111110011111111110010111101000011111100111111 3f3f3fb0e33f3fe2e33fa4adb3b73f8fa9af8fe3a63f3fbddb3f3ff2f43f3f
UTF-8 玲곷씭違띷뤃矣낅き碍⑸Ŧ釗겼춢循뗫걤鶯뱁꼩 1110111110100110101011011110101010110011101101111110110010010100101011011110100110000001100101011110101110011101101101111110101110100100100000111110011110011111101000111110101110000010100001011110001110000001100011011110011110100010100011011110001010010001101110001100010110100110111010011000011110010111111010101011001010111100111011001011011010100010111001011011111010101010111010111001011110101011111010101011000110100100111010011011011010101111111010111011000110000001111010101011110010101001 efa6adeab3b7ec94ade98195eb9db7eba483e79fa3eb8285e3818de7a28de291b8c5a6e98797eab2bcecb6a2e5beaaeb97abeab1a4e9b6afebb181eabca9
UHC 玲곷씭違띷뤃矣낅き碍⑸Ŧ釗겼춢循뗫걤鶯뱁꼩 111001111011111110000001111010111001110110111110111010101101111010001101111001101000111110110100111010111111100010000101111010111010101010101101111001001111010010101001111010111010100010101110111000011111001010110000111001011010110110000011111000101110000010001011111010111000000110001101111001011010001110111001111011011000010010000110 e7bf81eb9dbeeade8de68fb4ebf885ebaaade4f4a9eba8aee1f2b0e5ad83e2e08beb818de5a3b9ed8486

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)