To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???b?????????b??????^ 001111110011111100111111011000100011111100111111001111110011111100111111001111110011111100111111001111110110001000111111001111110011111100111111001111110011111101011110 3f3f3f623f3f3f3f3f3f3f3f3f623f3f3f3f3f3f5e
SJIS-WIN 繹??b???鴉??繹??b???鴉??^ 11100011100010000011111100111111011000100011111100111111001111111110100111101011001111110011111111100011100010000011111100111111011000100011111100111111001111111110100111101011001111110011111101011110 e3883f3f623f3f3fe9eb3f3fe3883f3f623f3f3fe9eb3f3f5e
EUC-JP 繹??b???鴉??繹??b???鴉??^ 11100101111010000011111100111111011000100011111100111111001111111111001011101101001111110011111111100101111010000011111100111111011000100011111100111111001111111111001011101101001111110011111101011110 e5e83f3f623f3f3ff2ed3f3fe5e83f3f623f3f3ff2ed3f3f5e
UTF-8 繹먮젾b凉붾졁鴉듿ㄷ繹먮젾b凉붾졁鴉듿엳^ 111001111011100110111001111010111010100010101110111011001010000010111110011000101110111110100101101110011110101110110110101111101110110010100001100000011110100110110100100010011110101110010011101111111110001110000100101101111110011110111001101110011110101110101000101011101110110010100000101111100110001011101111101001011011100111101011101101101011111011101100101000011000000111101001101101001000100111101011100100111011111111101100100101111011001101011110 e7b9b9eba8aeeca0be62efa5b9ebb6beeca181e9b489eb93bfe384b7e7b9b9eba8aeeca0be62efa5b9ebb6beeca181e9b489eb93bfec97b35e
UHC 繹먮젾b凉붾졁鴉듿ㄷ繹먮젾b凉붾졁鴉듿엳^ 111001101011101010010000111010111010000010110000011000101110010110111100100101001110101110100000101100101110010010111100100010101110010110100100101001111110011010111010100100001110101110100000101100000110001011100101101111001001010011101011101000001011001011100100101111001000101011100101100111101000100001011110 e6ba90eba0b062e5bc94eba0b2e4bc8ae5a4a7e6ba90eba0b062e5bc94eba0b2e4bc8ae59e885e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)