To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????? 001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f
SJIS-WIN 翁???莊??豚傲 10001001101001010011111100111111001111111110010010110101001111110011111110010011110110001001100011111100 89a53f3f3fe4b53f3f93d898fc
EUC-JP 翁???莊??豚傲 10110010101001110011111100111111001111111110100010110111001111110011111111000110110110101101000011111110 b2a73f3f3fe8b73f3fc6dad0fe
UTF-8 翁골렰렑莊렱뤯豚傲 111001111011111110000001111010101011001110101000111010111010000010110000111010111010000010010001111010001000111010001010111010111010000010110001111010111010010010101111111010001011000110011010111001011000001010110010 e7bf81eab3a8eba0b0eba091e88e8aeba0b1eba4afe8b19ae582b2
UHC 翁골렰렑莊렱뤯豚傲 111010001011101010110000111100011000111010111101100011101010011011101101111101101000111010111110100011111101110111010100110010101110011111101100 e8bab0f18ebd8ea6edf68ebe8fddd4cae7ec

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)