To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 臟????雄???? 111001000110011000111111001111110011111100111111100101110101100100111111001111110011111100111111 e4663f3f3f3f97593f3f3f3f
EUC-JP 臟????雄???? 111001111100011100111111001111110011111100111111110011011011101000111111001111110011111100111111 e7c73f3f3f3fcdba3f3f3f3f
UTF-8 臟렭亐꿜혈雄잿렪몇렏 111010001000011110011111111010111010000010101101111001001011101010010000111010101011111110011100111011011001100010001000111010011001101110000100111011001001111010111111111010111010000010101010111010111010101010000111111010111010000010001111 e8879feba0ade4ba90eabf9ced9888e99b84ec9ebfeba0aaebaa87eba08f
UHC 臟렭亐꿜혈雄잿렪몇렏 1110110111110100100011101011101011101010101001111011001011100100110001111111011111101010101010011100000011101101100011101011100010111000111011101000111010100101 edf48ebaeaa7b2e4c7f7eaa9c0ed8eb8b8ee8ea5

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)