To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 厭??癌?????閻??疫??B 1000100101111101001111110011111110001010111000000011111100111111001111110011111100111111111010001000010100111111001111111000100101110101001111110011111101000010 897d3f3f8ae03f3f3f3f3fe8853f3f89753f3f42
EUC-JP 厭??癌?????閻??疫??B 1011000111011110001111110011111110110100111000100011111100111111001111110011111100111111111011111110010100111111001111111011000111010110001111110011111101000010 b1de3f3fb4e23f3f3f3f3fefe53f3fb1d63f3f42
UTF-8 厭양쥨癌꿩뮈轢녺뼻閻싧겛疫욘펵B 11100101100011101010110111101100100101101001000111101100101001011010100011100111100110011000110011101010101111111010100111101011101011101000100011101111101001101000110111101011100001011011101011101011101111001011101111101001100101101011101111101100100010111010011111101010101100101001101111100111100101101010101111101100100110101001100011101101100011101011010101000010 e58eadec9691eca5a8e7998ceabfa9ebae88efa68deb85baebbcbbe996bbec8ba7eab29be796abec9a98ed8eb542
UHC 厭양쥨癌꿩뮈轢녺뼻閻싧겛疫욘펵B 11100110111101001011111011100111101000101001101011100100110111111011001011100110101110011011111111100110101111001000011011100111100101101011111011100111101000101001101011100101100000011011001011100110101110011011111111100110101111001000011001000010 e6f4bee7a29ae4dfb2e6b9bfe6bc86e796bee7a29ae581b2e6b9bfe6bc8642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)