To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????? 00111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f
SJIS-WIN ?基??姐?? 001111111000101011101110001111110011111110001000101101110011111100111111 3f8aee3f3f88b73f3f
EUC-JP 塼基??姐?? 1000111110111000101110011011010011110000001111110011111110110000101110010011111100111111 8fb8b9b4f03f3fb0b93f3f
UTF-8 塼基렲렏姐브혤 111001011010000110111100111001011001111110111010111010111010000010110010111010111010000010001111111001011010011110010000111010111011100010001100111011011001100010100100 e5a1bce59fbaeba0b2eba08fe5a790ebb88ced98a4
UHC 塼基렲렏姐브혤 1110111011110100110100001111000110001110101111111000111010100101111011101011101110111010111010101100100010100001 eef4d0f18ebf8ea5eebbbaeac8a1

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)