To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????? 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN トス耳爾遙シス 111100001011000011000100111100011000111010111101100011101010100011110000101111111000111010100010111010101010000110111100111100011000111010111101 f0b0c4f18ebd8ea8f0bf8ea2eaa1bcf18ebd
EUC-JP ?ト?ス耳?爾遙シ?ス 001111111000111011000100001111111000111010111101101111001010101000111111101111001010010011110100101000111000111010111100001111111000111010111101 3f8ec43f8ebdbcaa3fbca4f4a38ebc3f8ebd
UTF-8 トス耳爾遙シス 111011101000000110101111111011111011111010000100111011101000010010001001111011111011110110111101111010001000000010110011111011101000000110111110111001111000100010111110111010011000000110011001111011111011110110111100111011101000010010001001111011111011110110111101 ee81afefbe84ee8489efbdbde880b3ee81bee788bee98199efbdbcee8489efbdbd
UHC ????耳?爾遙??? 0011111100111111001111110011111111101100101111000011111111101100101100111110100110101011001111110011111100111111 3f3f3f3fecbc3fecb3e9ab3f3f3f

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)