To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 移??絲??无諸鞨? 100010001101101000111111001111111110001101001110001111110011111110011101110110011000111110010100111010001110000000111111 88da3f3fe34e3f3f9dd98f94e8e03f
EUC-JP 移??絲??无諸鞨? 101100001101110000111111001111111110010110101111001111110011111111011010110110111011110111110100111100001110001000111111 b0dc3f3fe5af3f3fdadbbdf4f0e23f
UTF-8 移쇨렚絲렠쇨无諸鞨렮 111001111010011110111011111011001000011110101000111010111010000010011010111001111011010110110010111010111010000010100000111011001000011110101000111001101001011110100000111010001010101110111000111010011001111010101000111010111010000010101110 e7a7bbec87a8eba09ae7b5b2eba0a0ec87a8e697a0e8abb8e99ea8eba0ae
UHC 移쇨렚絲렠쇨无諸鞨렮 1110110010111001101111001110101010001110101011011101111011101010100011101011000110111100111010101101100111101001111100001011001111001010111010101000111010111011 ecb9bcea8eaddeea8eb1bcead9e9f0b3caea8ebb

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)