To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????? 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 簇絅?夷???存??裁 11100010110001101110001101000100001111111000100011001110001111110011111100111111100100011011011000111111001111111000110111011001 e2c6e3443f88ce3f3f3f91b63f3f8dd9
EUC-JP 簇絅?夷???存??裁 11100100110010001110010110100101001111111011000011010000001111110011111100111111110000101011100000111111001111111011101011011011 e4c8e5a53fb0d03f3f3fc2b83f3fbadb
UTF-8 簇絅섯夷댓렰렱存띱밸裁 111001111011000010000111111001111011010110000101111011001000010010101111111001011010010010110111111010111000110010010011111010111010000010110000111010111010000010110001111001011010110110011000111010111001110110110001111010111011000010111000111010001010001110000001 e7b087e7b585ec84afe5a4b7eb8c93eba0b0eba0b1e5ad98eb9db1ebb0b8e8a381
UHC 簇絅섯夷댓렰렱存띱밸裁 11110000111010101100110011100111101111001011100011101100101010001011010011110001100011101011110110001110101111101111000011101101101101101111000010111001111010111110111010101110 f0eacce7bcb8eca8b4f18ebd8ebef0edb6f0b9ebeeae

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)