To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????? 00111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f
SJIS-WIN ??蔚?製?? 001111110011111110001001010101010011111110010000101110110011111100111111 3f3f89553f90bb3f3f
EUC-JP ??蔚?製?頊 0011111100111111101100011011011000111111110000001011110100111111100011111110011111110100 3f3fb1b63fc0bd3f8fe7f4
UTF-8 亐렍蔚렰製렲頊 111001001011101010010000111010111010000010001101111010001001010010011010111010111010000010110000111010001010001110111101111010111010000010110010111010011010000010001010 e4ba90eba08de8949aeba0b0e8a3bdeba0b2e9a08a
UHC 亐렍蔚렰製렲頊 1110101010100111100011101010001111101010101001011000111010111101111100001011001010001110101111111110100111110101 eaa78ea3eaa58ebdf0b28ebfe9f5

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)