To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 族儡???油???族儡???油???^ 10010001101100001001100101010011001111110011111100111111100101101111101100111111001111110011111110010001101100001001100101010011001111110011111100111111100101101111101100111111001111110011111101011110 91b099533f3f3f96fb3f3f3f91b099533f3f3f96fb3f3f3f5e
EUC-JP 族儡藿??油???族儡藿??油???^ 1100001010110010110100011011010010001111110110101010000100111111001111111100110011111101001111110011111100111111110000101011001011010001101101001000111111011010101000010011111100111111110011001111110100111111001111110011111101011110 c2b2d1b48fdaa13f3fccfd3f3f3fc2b2d1b48fdaa13f3fccfd3f3f3f5e
UTF-8 族儡藿롊렔油쇼렓螺族儡藿롊렔油쇼렓羅^ 11100110100101111000111111100101100001001010000111101000100101111011111111101011101000011000101011101011101000001001010011100110101100101011100111101100100001111011110011101011101000001001001111101111101001001001000111100110100101111000111111100101100001001010000111101000100101111011111111101011101000011000101011101011101000001001010011100110101100101011100111101100100001111011110011101011101000001001001111101111101001001000111101011110 e6978fe584a1e897bfeba18aeba094e6b2b9ec87bceba093efa491e6978fe584a1e897bfeba18aeba094e6b2b9ec87bceba093efa48f5e
UHC 族儡藿롊렔油쇼렓螺族儡藿롊렔油쇼렓羅^ 11110000111010011101011011101101110011101010101110001110110100001000111010101001111010101111101010111100111011101000111010101000110100011101111011110000111010011101011011101101110011101010101110001110110100001000111010101001111010101111101010111100111011101000111010101000110100011101110001011110 f0e9d6edceab8ed08ea9eafabcee8ea8d1def0e9d6edceab8ed08ea9eafabcee8ea8d1dc5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)