To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????^ 001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f5e
SJIS-WIN 単奪続村単奪続村^ 1001001001010000100100100100010010010001101100011001000110111010100100100101000010010010010001001001000110110001100100011011101001011110 9250924491b191ba9250924491b191ba5e
EUC-JP 単奪続村単奪続村^ 1100001110110001110000111010010111000010101100111100001010111100110000111011000111000011101001011100001010110011110000101011110001011110 c3b1c3a5c2b3c2bcc3b1c3a5c2b3c2bc5e
UTF-8 単奪続村単奪続村^ 11100101100011011001100011100101101001011010101011100111101101101001101011100110100111011001000111100101100011011001100011100101101001011010101011100111101101101001101011100110100111011001000101011110 e58d98e5a5aae7b69ae69d91e58d98e5a5aae7b69ae69d915e
UHC ?奪?村?奪?村^ 00111111111101111010110000111111111101011011110100111111111101111010110000111111111101011011110101011110 3ff7ac3ff5bd3ff7ac3ff5bd5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)