To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????? 001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f
SJIS-WIN 狎??唯⑨?猷?? 11100000101111100011111100111111100101110100001010000111010010000011111110010111010100010011111100111111 e0be3f3f974287483f97513f3f
EUC-JP 狎??唯??猷?? 111000001100000000111111001111111100110110100011001111110011111111001101101100100011111100111111 e0c03f3fcda33f3fcdb23f3f
UTF-8 狎녴끉唯⑨쭑猷잙븫 111001111000101110001110111010111000010110110100111010111000000110001001111001011001010010101111111000101001000110101000111011001010110110010001111001111000110010110111111011001001111010011001111010111011100010101011 e78b8eeb85b4eb8189e594afe291a8ecad91e78cb7ec9e99ebb8ab
UHC 狎녴끉唯⑨쭑猷잙븫 111001001110010010000110111000111000010110111100111010101110011010101000111011111010011110001001111010111010001110011111111010111001010110010100 e4e486e385bceae6a8efa789eba39feb9594

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)