To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????????? 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 昻??竊??猷⑤?如???g?諛??癲??肉? 1111101011010000001111110011111111100010100001100011111100111111100101110101000110000111010001000011111110010100010000000011111100111111001111111000001010000111001111111110011010000111001111110011111111100001100111110011111100111111100100111111011100111111 fad03f3fe2863f3f975187443f94403f3f3f82873fe6873f3fe19f3f3f93f73f
EUC-JP ???竊??猷??如??靷g?諛??癲??肉? 0011111100111111001111111110001111100110001111110011111111001101101100100011111100111111110001111010000100111111001111111000111111100111101111011010001111100111001111111110101111100111001111110011111111100010101000010011111100111111110001101111100100111111 3f3f3fe3e63f3fcdb23f3fc7a13f3f8fe7bda3e73febe73f3fe2a13f3fc6f93f
UTF-8 昻뉗떜竊숂몭猷⑤쇊如붿슖靷g뙴諛댁꽠癲쒖슜肉풟 111001101001100010111011111010111000100110010111111010111001011010011100111001111010101110001010111011001000100010000010111010111010101010101101111001111000110010110111111000101001000110100100111011001000011110001010111001011010011010000010111010111011011010111111111011001000101010010110111010011001110110110111111011111011110110000111111010111001100110110100111010001010101110011011111010111000110010000001111010101011110110100000111001111001100110110010111011001001001010010110111011001000101010011100111010001000001010001001111011011001001010011111 e698bbeb8997eb969ce7ab8aec8882ebaaade78cb7e291a4ec878ae5a682ebb6bfec8a96e99db7efbd87eb99b4e8ab9beb8c81eabda0e799b2ec9296ec8a9ce88289ed929f
UHC 昻뉗떜竊숂몭猷⑤쇊如붿슖靷g뙴諛댁꽠癲쒖슜肉풟 11100100111010011000011111101100100010111011001011101111101111001001100111100111100100011001011111101011101000111010100011101011100110011011110011100101111111011001010011101100100110101010010111101100111001101010001111100111100011001011011111101011101100001011010011101100100001001010110111101111101001101001110011101100100110101010100111101011101111111011111101000010 e4e987ec8bb2efbc99e79197eba3a8eb99bce5fd94ec9aa5ece6a3e78cb7ebb0b4ec84adefa69cec9aa9ebbfbf42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)