To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????C??? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000011001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f433f3f3f
SJIS-WIN ??????????????????C??? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000011001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f433f3f3f
EUC-JP ??????????????????C??? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000011001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f433f3f3f
UTF-8 책혨쨀챘짹혙챙혟쩔챌쨍혮챘혨쨉챙혡쨈C책짢혘 11101100101100011000010111101101100110001010100011101100101010001000000011101100101100011001100011101100101001111011100111101101100110001001100111101100101100011001100111101101100110001001111111101100101010011001010011101100101100011000110011101100101010001000110111101101100110001010111011101100101100011001100011101101100110001010100011101100101010001000100111101100101100011001100111101101100110001010000111101100101010001000100001000011111011001011000110000101111011001010011110100010111011011001100010011000 ecb185ed98a8eca880ecb198eca7b9ed9899ecb199ed989feca994ecb18ceca88ded98aeecb198ed98a8eca889ecb199ed98a1eca88843ecb185eca7a2ed9898
UHC 책혨쨀챘짹혙챙혟쩔챌쨍혮챘혨쨉챙혡쨈C책짢혘 11000011101001011100001010010000110000101011001111000011101010111100001010110001110000101000010011000011101011001100001010001001110000101011111111000011101001111100001010111000110000101001010111000011101010111100001010010000110000101011010111000011101011001100001010001010110000101011010001000011110000111010010111000010101010001100001010000011 c3a5c290c2b3c3abc2b1c284c3acc289c2bfc3a7c2b8c295c3abc290c2b5c3acc28ac2b443c3a5c2a8c283

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)