To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ??????語???????????鴉?? 0011111100111111001111110011111100111111001111111000110011101010001111110011111100111111001111110011111100111111001111110011111100111111001111110011111111101001111010110011111100111111 3f3f3f3f3f3f8cea3f3f3f3f3f3f3f3f3f3f3fe9eb3f3f
EUC-JP 倻?????語???????????鴉?? 10001111101100011111011000111111001111110011111100111111001111111011100011101100001111110011111100111111001111110011111100111111001111110011111100111111001111110011111111110010111011010011111100111111 8fb1f63f3f3f3f3fb8ec3f3f3f3f3f3f3f3f3f3f3ff2ed3f3f
UTF-8 倻뽩뀬溜⑸젾語ⓩ뜤溜뽯젙料ㅺ퀗溜깅졁鴉딂콊 111001011000000010111011111010111011110110101001111010111000000010101100111011111010011110001011111000101001000110111000111011001010000010111110111010001010101010011110111000101001001110101001111010111001110010100100111011111010011110001011111010111011110110101111111011001010000010011001111011111010011010111110111000111000010110111010111011011000000010010111111011111010011110001011111010101011100110000101111011001010000110000001111010011011010010001001111010111001010010000010111011001011110110001010 e580bbebbda9eb80acefa78be291b8eca0bee8aa9ee293a9eb9ca4efa78bebbdafeca099efa6bee385baed8097efa78beab985eca181e9b489eb9482ecbd8a
UHC 倻뽩뀬溜⑸젾語ⓩ뜤溜뽯젙料ㅺ퀗溜깅졁鴉딂콊 111001011010011010010110111001011000010110100010111010101111111010101001111010111010000010110000111001011101111010101000111001101000110110100111111010101111111010010110111010111010000010010101111010001111011110100100111010101011001110001100111010101111111010110001111010111010000010110010111001001011110010001010111010001011000110000110 e5a696e585a2eafea9eba0b0e5dea8e68da7eafe96eba095e8f7a4eab38ceafeb1eba0b2e4bc8ae8b186

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)