To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???侑??嶢?淫? 00111111001111110011111110011000110100000011111100111111100110111101000000111111100010001111101000111111 3f3f3f98d03f3f9bd03f88fa3f
EUC-JP 塡??侑??嶢?淫? 100011111011100010110100001111110011111111010000110100100011111100111111110101101101001000111111101100001111110000111111 8fb8b43f3fd0d23f3fd6d23fb0fc3f
UTF-8 塡재침侑띔퇴嶢렏淫동 111001011010000110100001111011001001111010101100111011001011100110101000111001001011111010010001111010111001110110010100111011011000011110110100111001011011011010100010111010111010000010001111111001101011011110101011111010111000111110011001 e5a1a1ec9eacecb9a8e4be91eb9d94ed87b4e5b6a2eba08fe6b7abeb8f99
UHC 塡재침侑띔퇴嶢렏淫동 1110111011110011110000001110011111000100101001111110101011100010101101101110101011000101111100001110100011110010100011101010010111101011111000101011010110111111 eef3c0e7c4a7eae2b6eac5f0e8f28ea5ebe2b5bf

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)