To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????????????TB 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110101010001000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5442
SJIS-WIN 鸚??????ο?邑?????TB 1110101001011111001111110011111100111111001111110011111100111111100000111100110100111111100101110101011100111111001111110011111100111111001111110101010001000010 ea5f3f3f3f3f3f3f83cd3f97573f3f3f3f3f5442
EUC-JP 鸚??????ο?邑?????TB 1111001111000000001111110011111100111111001111110011111100111111101001101100111100111111110011011011100000111111001111110011111100111111001111110101010001000010 f3c03f3f3f3f3f3fa6cf3fcdb83f3f3f3f3f5442
UTF-8 鸚쒓퍓履뗦룚理ο쫱邑곗뫀歷몄눅TB 11101001101110001001101011101100100100101001001111101101100011011001001111101111101001111001111111101011100101111010011011101011101000111001101011101111101001111010010011001110101111111110110010101011101100011110100110000010100100011110101010110011100101111110101110101011100000001110111110100110100011001110101110101010100001001110101110001000100001010101010001000010 e9b89aec9293ed8d93efa79feb97a6eba39aefa7a4cebfecabb1e98291eab397ebab80efa68cebaa84eb88855442
UHC 鸚쒓퍓履뗦룚理ο쫱邑곗뫀歷몄눅TB 1110010110100100100111001110101010111011100010101110110010101010100010111110011010001111100101101110110010110101101001011110111110100110100010011110101111101001101100001110110010010001101001001110011010111000101110001110110010110100101010100101010001000010 e5a49ceabb8aecaa8be68f96ecb5a5efa689ebe9b0ec91a4e6b8b8ecb4aa5442

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)