To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????? 0011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f
SJIS-WIN ?ο?歪??節? 0011111110000011110011010011111110011000011000110011111100111111100100001101111100111111 3f83cd3f98633f3f90df3f
EUC-JP ?ο?歪??節? 0011111110100110110011110011111111001111110001000011111100111111110000001110000100111111 3fa6cf3fcfc43f3fc0e13f
UTF-8 遼ο슝歪뤻쐥節㎫ 1110111110100111100000111100111010111111111011001000101010011101111001101010110110101010111010111010010010111011111011001001000010100101111001111010111110000000111000111000111010101011 efa783cebfec8a9de6adaaeba4bbec90a5e7af80e38eab
UHC 遼ο슝歪뤻쐥節㎫ 11101001101011001010010111101111101111011011100111101000111000001000111111101001100111001000101011101111101111011010011111100111 e9aca5efbdb9e8e08fe99c8aefbda7e7

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)