To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???誼?????癲?????蟻??癲??? 0011111100111111001111111000101101100010001111110011111100111111001111110011111111100001100111110011111100111111001111110011111100111111100010110110000100111111001111111110000110011111001111110011111100111111 3f3f3f8b623f3f3f3f3fe19f3f3f3f3f3f8b613f3fe19f3f3f3f
EUC-JP ???誼?????癲?????蟻??癲??? 0011111100111111001111111011010111000011001111110011111100111111001111110011111111100010101000010011111100111111001111110011111100111111101101011100001000111111001111111110001010100001001111110011111100111111 3f3f3fb5c33f3f3f3f3fe2a13f3f3f3f3fb5c23f3fe2a13f3f3f
UTF-8 劣믪럥誼들젏紐껋젔癲뚮챷罹덌쭓蟻숈졄癲쑳살졂 111011111010011010011101111010111010111110101010111010111001111110100101111010001010101010111100111010111001001110100100111011001010000010001111111011111010011110001111111010101011101110001011111011001010000010010100111001111001100110110010111010111001101010101110111011001011000110110111111011111010011110100110111010111000110110001100111011001010110110010011111010001001111110111011111011001000100010001000111011001010000110000100111001111001100110110010111011001001000110110011111011001000001010110100111011001010000110000010 efa69debafaaeb9fa5e8aabceb93a4eca08fefa78feabb8beca094e799b2eb9aaeecb1b7efa7a6eb8d8cecad93e89fbbec8888eca184e799b2ec91b3ec82b4eca182
UHC 劣믪럥誼들젏紐껋젔癲뚮챷罹덌쭓蟻숈졄癲쑳살졂 1110011011101011100100101110110010001110100010001110101111111110101101011110100110100000100100001110101110101010100000111110110010100000100100101110111110100110100011001110101110101010100001001110110010111010100010001110111110100111100010111110101111111100100110011110110010100000101101011110111110100110100111001100111010111011111011001010000010110011 e6eb92ec8e88ebfeb5e9a090ebaa83eca092efa68cebaa84ecba88efa78bebfc99eca0b5efa69ccebbeca0b3

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)