To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????? 001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f
SJIS-WIN 玉??亦??鳶?? 100010111100101000111111001111111001011010010010001111110011111110010011110011100011111100111111 8bca3f3f96923f3f93ce3f3f
EUC-JP 玉??亦??鳶?? 101101101100110000111111001111111100101111110010001111110011111111000110110100000011111100111111 b6cc3f3fcbf23f3fc6d03f3f
UTF-8 玉뚧솤亦귡굚鳶쀥뎐 111001111000111010001001111010111001101010100111111011001000011010100100111001001011101010100110111010101011011110100001111010101011010110011010111010011011001110110110111011001000000010100101111010111000111010010000 e78e89eb9aa7ec86a4e4baa6eab7a1eab59ae9b3b6ec80a5eb8e90
UHC 玉뚧솤亦귡굚鳶쀥뎐 111010001010110010001100111001101001100110011110111001101011001010000010111010011000001010000010111001101110100110010111111001011011010110101111 e8ac8ce6999ee6b282e98282e6e997e5b5af

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)