To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 櫻??異?????夜?????應??罌??苑 10011111010011100011111100111111100010001101100100111111001111110011111100111111001111111001011011101001001111110011111100111111001111110011111110011100111001000011111100111111111000111010000000111111001111111000100110010001 9f4e3f3f88d93f3f3f3f3f96e93f3f3f3f3f9ce43f3fe3a03f3f8991
EUC-JP 櫻??異?????夜?????應??罌??苑 11011101101011110011111100111111101100001101101100111111001111110011111100111111001111111100110011101011001111110011111100111111001111110011111111011000111001100011111100111111111001101010001000111111001111111011000111110001 ddaf3f3fb0db3f3f3f3f3fcceb3f3f3f3f3fd8e63f3fe6a23f3fb1f1
UTF-8 櫻뗣굞異꿨땔琉뷩걧夜껋뮂璘뺠뿥應밤궚罌븍뱽苑 111001101010101110111011111010111001011110100011111010101011010110011110111001111001010110110000111010101011111110101000111010111001010110010100111011111010011110001100111010111011011110101001111010101011000110100111111001011010010010011100111010101011101110001011111010111010111010000010111011111010011110101111111010111011101010100000111010111011111110100101111001101000011110001001111010111011000010100100111010101011011010011010111001111011110110001100111010111011100010001101111010111011000110111101111010001000101110010001 e6abbbeb97a3eab59ee795b0eabfa8eb9594efa78cebb7a9eab1a7e5a49ceabb8bebae82efa7afebbaa0ebbfa5e68789ebb0a4eab69ae7bd8cebb88debb1bde88b91
UHC 櫻뗣굞異꿨땔琉뷩걧夜껋뮂璘뺠뿥應밤궚罌븍뱽苑 1110010110100001100010111110001110000010100001101110110010110110101100101110010110110110101010101110101110100100101110101110001110000001100100001110010110101000100000111110110010010010100100011110110011011110100101011110100010010111101001011110101111101011101110011110001110000010101011111110010110100010101110101110101110010011101000111110101010111101 e5a18be38286ecb6b2e5b6aaeba4bae38190e5a883ec9291ecde95e897a5ebebb9e382afe5a2baeb93a3eabd

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)