To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????}?????????{^ 001111110011111100111111001111110011111100111111001111110011111100111111011111010011111100111111001111110011111100111111001111110011111100111111001111110111101101011110 3f3f3f3f3f3f3f3f3f7d3f3f3f3f3f3f3f3f3f7b5e
SJIS-WIN ???震?菖???}???震?菖???{^ 00111111001111110011111110010000011010110011111110001111110100100011111100111111001111110111110100111111001111110011111110010000011010110011111110001111110100100011111100111111001111110111101101011110 3f3f3f906b3f8fd23f3f3f7d3f3f3f906b3f8fd23f3f3f7b5e
EUC-JP ???震?菖???}???震?菖???{^ 00111111001111110011111110111111110011000011111110111110110101000011111100111111001111110111110100111111001111110011111110111111110011000011111110111110110101000011111100111111001111110111101101011110 3f3f3fbfcc3fbed43f3f3f7d3f3f3fbfcc3fbed43f3f3f7b5e
UTF-8 앉렻샬震롌菖렚롉롉}앉렻샬震롌菖렚롉롉{^ 111011001001010110001001111010111010000010111011111011001000001110101100111010011001110010000111111010111010000110001100111010001000111110010110111010111010000010011010111010111010000110001001111010111010000110001001011111011110110010010101100010011110101110100000101110111110110010000011101011001110100110011100100001111110101110100001100011001110100010001111100101101110101110100000100110101110101110100001100010011110101110100001100010010111101101011110 ec9589eba0bbec83ace99c87eba18ce88f96eba09aeba189eba1897dec9589eba0bbec83ace99c87eba18ce88f96eba09aeba189eba1897b5e
UHC 앉렻샬震롌菖렚롉롉}앉렻샬震롌菖렚롉롉{^ 101111101100100110001110110000111011110010100011111100101110100010001110110100101111001111101110100011101010110110001110110011111000111011001111011111011011111011001001100011101100001110111100101000111111001011101000100011101101001011110011111011101000111010101101100011101100111110001110110011110111101101011110 bec98ec3bca3f2e88ed2f3ee8ead8ecf8ecf7dbec98ec3bca3f2e88ed2f3ee8ead8ecf8ecf7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)