To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????? 001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f
SJIS-WIN 移??絲??无諸鞨 1000100011011010001111110011111111100011010011100011111100111111100111011101100110001111100101001110100011100000 88da3f3fe34e3f3f9dd98f94e8e0
EUC-JP 移??絲??无諸鞨 1011000011011100001111110011111111100101101011110011111100111111110110101101101110111101111101001111000011100010 b0dc3f3fe5af3f3fdadbbdf4f0e2
UTF-8 移쇨렚絲렠쇨无諸鞨 111001111010011110111011111011001000011110101000111010111010000010011010111001111011010110110010111010111010000010100000111011001000011110101000111001101001011110100000111010001010101110111000111010011001111010101000 e7a7bbec87a8eba09ae7b5b2eba0a0ec87a8e697a0e8abb8e99ea8
UHC 移쇨렚絲렠쇨无諸鞨 111011001011100110111100111010101000111010101101110111101110101010001110101100011011110011101010110110011110100111110000101100111100101011101010 ecb9bcea8eaddeea8eb1bcead9e9f0b3caea

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)