To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???外??汚??節??俉?????外??汚 00111111001111110011111110001010010011110011111100111111100010011001100000111111001111111001000011011111001111110011111111111010011000010011111100111111001111110011111100111111100010100100111100111111001111111000100110011000 3f3f3f8a4f3f3f89983f3f90df3f3ffa613f3f3f3f3f8a4f3f3f8998
EUC-JP ???外??汚??節??俉?????外??汚 0011111100111111001111111011001110110000001111110011111110110001111110000011111100111111110000001110000100111111001111111000111110110001101110110011111100111111001111110011111100111111101100111011000000111111001111111011000111111000 3f3f3fb3b03f3fb1f83f3fc0e13f3f8fb1bb3f3f3f3f3fb3b03f3fb1f8
UTF-8 筽띰슐外뺧쉴汚뉐걞節쏙쉿俉녈깋筽띰슐外뺧쉴汚 111001111010110110111101111010111001110110110000111011001000101010010000111001011010010010010110111010111011101010100111111011001000100110110100111001101011000110011010111010111000100110010000111010101011000110011110111001111010111110000000111011001000111110011001111011001000100110111111111001001011111110001001111010111000010110001000111010101011100110001011111001111010110110111101111010111001110110110000111011001000101010010000111001011010010010010110111010111011101010100111111011001000100110110100111001101011000110011010 e7adbdeb9db0ec8a90e5a496ebbaa7ec89b4e6b19aeb8990eab19ee7af80ec8f99ec89bfe4bf89eb8588eab98be7adbdeb9db0ec8a90e5a496ebbaa7ec89b4e6b19a
UHC 筽띰슐外뺧쉴汚뉐걞節쏙쉿俉녈깋筽띰슐外뺧쉴汚 1110100010100100101101101110111110111101101101101110100011100010100101011110111110111101101011111110011111111101100001111110010110000001100001111110111110111101101111011110111110111101101100101110011111101011101100111110001110000011100010011110100010100100101101101110111110111101101101101110100011100010100101011110111110111101101011111110011111111101 e8a4b6efbdb6e8e295efbdafe7fd87e58187efbdbdefbdb2e7ebb3e38389e8a4b6efbdb6e8e295efbdafe7fd

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)