To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 蘂??譯??蘂??怏??蘂??譯??蘂??怏??^ 111001010100000100111111001111111110011010100001001111110011111111100101010000010011111100111111100111001000100100111111001111111110010101000001001111110011111111100110101000010011111100111111111001010100000100111111001111111001110010001001001111110011111101011110 e5413f3fe6a13f3fe5413f3f9c893f3fe5413f3fe6a13f3fe5413f3f9c893f3f5e
EUC-JP 蘂??譯??蘂??怏??蘂??譯??蘂??怏??^ 111010011010001000111111001111111110110010100011001111110011111111101001101000100011111100111111110101111110100100111111001111111110100110100010001111110011111111101100101000110011111100111111111010011010001000111111001111111101011111101001001111110011111101011110 e9a23f3feca33f3fe9a23f3fd7e93f3fe9a23f3feca33f3fe9a23f3fd7e93f3f5e
UTF-8 蘂뚮젚譯볥젉蘂노젾怏⑸꽭蘂뚮젚譯볥젉蘂노젾怏⑸꽪^ 11101000100110001000001011101011100110101010111011101100101000001001101011101000101011011010111111101011101100111010010111101100101000001000100111101000100110001000001011101011100001011011100011101100101000001011111011100110100000001000111111100010100100011011100011101010101111011010110111101000100110001000001011101011100110101010111011101100101000001001101011101000101011011010111111101011101100111010010111101100101000001000100111101000100110001000001011101011100001011011100011101100101000001011111011100110100000001000111111100010100100011011100011101010101111011010101001011110 e89882eb9aaeeca09ae8adafebb3a5eca089e89882eb85b8eca0bee6808fe291b8eabdade89882eb9aaeeca09ae8adafebb3a5eca089e89882eb85b8eca0bee6808fe291b8eabdaa5e
UHC 蘂뚮젚譯볥젉蘂노젾怏⑸꽭蘂뚮젚譯볥젉蘂노젾怏⑸꽪^ 11100111110111101000110011101011101000001001011011100110101110111001001111101011101000001000101111100111110111101011001111101011101000001011000011100100111010001010100111101011100001001011100011100111110111101000110011101011101000001001011011100110101110111001001111101011101000001000101111100111110111101011001111101011101000001011000011100100111010001010100111101011100001001011010101011110 e7de8ceba096e6bb93eba08be7deb3eba0b0e4e8a9eb84b8e7de8ceba096e6bb93eba08be7deb3eba0b0e4e8a9eb84b55e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)