To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????????????????? 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 除??錚???諸媛????錚???諸姨? 100011111001110000111111001111111110100001000010001111110011111100111111100011111001010010010101010100010011111100111111001111110011111111101000010000100011111100111111001111111000111110010100100110110100100000111111 8f9c3f3fe8423f3f3f8f9495513f3f3f3fe8423f3f3f8f949b483f
EUC-JP 除??錚???諸媛????錚???諸姨? 101111011111110000111111001111111110111110100011001111110011111100111111101111011111010011001001101100100011111100111111001111110011111111101111101000110011111100111111001111111011110111110100110101011010100100111111 bdfc3f3fefa33f3f3fbdf4c9b23f3f3f3fefa33f3f3fbdf4d5a93f
UTF-8 除곤성錚댄렎곈諸媛얘렣곤성錚댄렎곈諸姨옜 111010011001100110100100111010101011001110100100111011001000010010110001111010011000110010011010111010111000110010000100111010111010000010001110111010101011001110001000111010001010101110111000111001011010101010011011111011001001011010011000111010111010000010100011111010101011001110100100111011001000010010110001111010011000110010011010111010111000110010000100111010111010000010001110111010101011001110001000111010001010101110111000111001011010011110101000111011001001100010011100 e999a4eab3a4ec84b1e98c9aeb8c84eba08eeab388e8abb8e5aa9bec9698eba0a3eab3a4ec84b1e98c9aeb8c84eba08eeab388e8abb8e5a7a8ec989c
UHC 除곤성錚댄렎곈諸媛얘렣곤성錚댄렎곈諸姨옜 11110000101101101011000011101111101111001011101011101110101101101011010011101101100011101010010010110000111010011111000010110011111010101011000010111110111010101000111010110100101100001110111110111100101110101110111010110110101101001110110110001110101001001011000011101001111100001011001111101100101010011011111110111111 f0b6b0efbcbaeeb6b4ed8ea4b0e9f0b3eab0beea8eb4b0efbcbaeeb6b4ed8ea4b0e9f0b3eca9bfbf

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)