To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????^ 0011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 臍?蹈??臍?蹈??^ 111001000110000000111111111001110100010000111111001111111110010001100000001111111110011101000100001111110011111101011110 e4603fe7443f3fe4603fe7443f3f5e
EUC-JP 臍?蹈??臍?蹈??^ 111001111100000100111111111011011010010100111111001111111110011111000001001111111110110110100101001111110011111101011110 e7c13feda53f3fe7c13feda53f3f5e
UTF-8 臍면蹈렺렋臍면蹈렺렋^ 11101000100001111000110111101011101010011011010011101000101110011000100011101011101000001011101011101011101000001000101111101000100001111000110111101011101010011011010011101000101110011000100011101011101000001011101011101011101000001000101101011110 e8878deba9b4e8b988eba0baeba08be8878deba9b4e8b988eba0baeba08b5e
UHC 臍면蹈렺렋臍면蹈렺렋^ 111100001011000010111000111010011101010010110000100011101100001010001110101000101111000010110000101110001110100111010100101100001000111011000010100011101010001001011110 f0b0b8e9d4b08ec28ea2f0b0b8e9d4b08ec28ea25e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)