To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????????????????? 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 儀??倚??倚??窈??援?????佚? 1000101101010110001111110011111110011000110111110011111100111111100110001101111100111111001111111110001001110111001111110011111110001001100001110011111100111111001111110011111100111111100110001100001100111111 8b563f3f98df3f3f98df3f3fe2773f3f89873f3f3f3f3f98c33f
EUC-JP 儀??倚??倚??窈??援?????佚? 1011010110110111001111110011111111010000111000010011111100111111110100001110000100111111001111111110001111011000001111110011111110110001111001110011111100111111001111110011111100111111110100001100010100111111 b5b73f3fd0e13f3fd0e13f3fe3d83f3fb1e73f3f3f3f3fd0c53f
UTF-8 儀붾젨倚덈졎倚덈쨼窈욎떤援띠㉣泥몄㉤佚퍫 111001011000010010000000111010111011011010111110111011001010000010101000111001011000000010011010111010111000110110001000111011001010000110001110111001011000000010011010111010111000110110001000111011001010100010111100111001111010101010001000111011001001101010001110111010111001011010100100111001101000111110110100111010111001110110100000111000111000100110100011111011111010011110100011111010111010101010000100111000111000100110100100111001001011110110011010111011011000110110101011 e58480ebb6beeca0a8e5809aeb8d88eca18ee5809aeb8d88eca8bce7aa88ec9a8eeb96a4e68fb4eb9da0e389a3efa7a3ebaa84e389a4e4bd9aed8dab
UHC 儀붾젨倚덈졎倚덈쨼窈욎떤援띠㉣泥몄㉤佚퍫 11101011111100001001010011101011101000001010000011101011111011111000100011101011101000001011101111101011111011111000100011101011101001001001011011101001101000011001111011101100101101101011001011101010101101011011011011101100101010001011010011101100101100101011100011101100101010001011010111101100111010101011110001000010 ebf094eba0a0ebef88eba0bbebef88eba496e9a19eecb6b2eab5b6eca8b4ecb2b8eca8b5eceabc42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)