To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???泣i??≫?筌 0011111100111111001111111000101110000011100000101000100100111111001111111000000111100010001111111110001010100011 3f3f3f8b8382893f3f81e23fe2a3
EUC-JP ???泣i?飡≫?筌 00111111001111110011111110110101111000111010001111101001001111111000111111101000110010001010001011100100001111111110010010100101 3f3f3fb5e3a3e93f8fe8c8a2e43fe4a5
UTF-8 捻꿔꺂泣i뜮飡≫맪筌 111011111010011010100100111010101011111110010100111010101011101010000010111001101011001110100011111011111011110110001001111010111001110010101110111010011010001110100001111000101000100110101011111010111010011110101010111001111010110110001100 efa6a4eabf94eaba82e6b3a3efbd89eb9caee9a3a1e289abeba7aae7ad8c
UHC 捻꿔꺂泣i뜮飡≫맪筌 1110011011110111101100101110001110000011101010111110101111101000101000111110100110001101101011101110000111100010101000011110110110010000101100101110111110100111 e6f7b2e383abebe8a3e98daee1e2a1ed90b2efa7

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)