To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????N 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101001110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f4e
SJIS-WIN ’?嵩?モ’?純????ャ’?戍?ャ??N 100000010110011000111111100100001001001100111111100000111000001010000001011001100011111110001111100000110011111100111111001111110011111110000011100000111000000101100110001111111001110011111001001111111000001110000011001111110011111101001110 81663f90933f838281663f8f833f3f3f3f838381663f9cf93f83833f3f4e
EUC-JP ’?嵩?モ’?純????ャ’?戍?ャ??N 101000011100011100111111101111111111001100111111101001011110001010100001110001110011111110111101111000110011111100111111001111110011111110100101111000111010000111000111001111111101100011111011001111111010010111100011001111110011111101001110 a1c73fbff33fa5e2a1c73fbde33f3f3f3fa5e3a1c73fd8fb3fa5e33f3f4e
UTF-8 ’룶嵩캀モ’룶純◈룫치캀ャ’룶戍캀ャ룫집N 11100010100000001001100111101011101000111011011011100101101101011010100111101100101110101000000011100011100000111010001011100010100000001001100111101011101000111011011011100111101101001001010011100010100101111000100011101011101000111010101111101100101110011001100011101100101110101000000011100011100000111010001111100010100000001001100111101011101000111011011011100110100010001000110111101100101110101000000011100011100000111010001111101011101000111010101111101100101001111001000101001110 e28099eba3b6e5b5a9ecba80e383a2e28099eba3b6e7b494e29788eba3abecb998ecba80e383a3e28099eba3b6e6888decba80e383a3eba3abeca7914e
UHC ’룶嵩캀モ’룶純◈룫치캀ャ’룶戍캀ャ룫집N 1010000110101111100011111010101111100011101000011010111110001111101010111110001010100001101011111000111110101011111000101110110110100010110000101000111110100010110001001010000110101111100011111010101111100011101000011010111110001111101010111110001010100001101011111000111110101011111000111000111110100010110000011111110101001110 a1af8fabe3a1af8fabe2a1af8fabe2eda2c28fa2c4a1af8fabe3a1af8fabe2a1af8fabe38fa2c1fd4e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)