To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????????????????? 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 除??錚????媛????錚????姨? 10001111100111000011111100111111111010000100001000111111001111110011111100111111100101010101000100111111001111110011111100111111111010000100001000111111001111110011111100111111100110110100100000111111 8f9c3f3fe8423f3f3f3f95513f3f3f3fe8423f3f3f3f9b483f
EUC-JP 除??錚????媛????錚????姨? 10111101111111000011111100111111111011111010001100111111001111110011111100111111110010011011001000111111001111110011111100111111111011111010001100111111001111110011111100111111110101011010100100111111 bdfc3f3fefa33f3f3f3fc9b23f3f3f3fefa33f3f3f3fd5a93f
UTF-8 除곤성錚댄렎곌눴媛얘렣곤성錚댄렎곌눴姨옜 111010011001100110100100111010101011001110100100111011001000010010110001111010011000110010011010111010111000110010000100111010111010000010001110111010101011001110001100111010111000100010110100111001011010101010011011111011001001011010011000111010111010000010100011111010101011001110100100111011001000010010110001111010011000110010011010111010111000110010000100111010111010000010001110111010101011001110001100111010111000100010110100111001011010011110101000111011001001100010011100 e999a4eab3a4ec84b1e98c9aeb8c84eba08eeab38ceb88b4e5aa9bec9698eba0a3eab3a4ec84b1e98c9aeb8c84eba08eeab38ceb88b4e5a7a8ec989c
UHC 除곤성錚댄렎곌눴媛얘렣곤성錚댄렎곌눴姨옜 11110000101101101011000011101111101111001011101011101110101101101011010011101101100011101010010010110000111010101011010010110011111010101011000010111110111010101000111010110100101100001110111110111100101110101110111010110110101101001110110110001110101001001011000011101010101101001011001111101100101010011011111110111111 f0b6b0efbcbaeeb6b4ed8ea4b0eab4b3eab0beea8eb4b0efbcbaeeb6b4ed8ea4b0eab4b3eca9bfbf

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)