To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 癲?8意??受??語⑤?日??揄ы?? 11100001100111110011111110000010010101111000100011010011001111110011111110001110111100110011111100111111100011001110101010000111010001000011111110010011111110100011111100111111100111011000100110000100100011010011111100111111 e19f3f825788d33f3f8ef33f3f8cea87443f93fa3f3f9d89848d3f3f
EUC-JP 癲?8意??受??語??日??揄ы?? 111000101010000100111111101000111011100010110000110101010011111100111111101111001111010100111111001111111011100011101100001111110011111111000110111111000011111100111111110110011110100110100111111011010011111100111111 e2a13fa3b8b0d53f3fbcf53f3fb8ec3f3fc6fc3f3fd9e9a7ed3f3f
UTF-8 癲쒕8意덌쭫受쇳뫛語⑤쨪日뉒춯揄ы떜亮 1110011110011001101100101110110010010010100101011110111110111100100110001110011010000100100011111110101110001101100011001110110010101101101010111110010110001111100101111110110010000111101100111110101110101011100110111110100010101010100111101110001010010001101001001110110010101000101010101110011010010111101001011110101110001001100100101110110010110110101011111110011010001111100001001101000110001011111010111001011010011100111011111010010110110111 e799b2ec9295efbc98e6848feb8d8cecadabe58f97ec87b3ebab9be8aa9ee291a4eca8aae697a5eb8992ecb6afe68f84d18beb969cefa5b7
UHC 癲쒕8意덌쭫受쇳뫛語⑤쨪日뉒춯揄ы떜亮 1110111110100110100111001110101110100011101110001110101111110010100010001110111110100111100111111110000111110100101111001110110110010001101110111110010111011110101010001110101110100100100001001110110011101101100001111110011110101101100011001110101011110001101011001110110110001011101100101110010110111001 efa69ceba3b8ebf288efa79fe1f4bced91bbe5dea8eba484eced87e7ad8ceaf1aced8bb2e5b9

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)