To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????O^ 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110100111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f4f5e
SJIS-WIN ?????????????????????O^ 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110100111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f4f5e
EUC-JP ?????????????????????O^ 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110100111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f4f5e
UTF-8 챈천혝챔쨈혝챠횪혡챔짭혻챦쩐혝챦쩐혘챈짹혨O^ 1110110010110001100010001110110010110010100111001110110110011000100111011110110010110001100101001110110010101000100010001110110110011000100111011110110010110001101000001110110110011010101010101110110110011000101000011110110010110001100101001110110010100111101011011110110110011000101110111110110010110001101001101110110010101001100100001110110110011000100111011110110010110001101001101110110010101001100100001110110110011000100110001110110010110001100010001110110010100111101110011110110110011000101010000100111101011110 ecb188ecb29ced989decb194eca888ed989decb1a0ed9aaaed98a1ecb194eca7aded98bbecb1a6eca990ed989decb1a6eca990ed9898ecb188eca7b9ed98a84f5e
UHC 챈천혝챔쨈혝챠횪혡챔짭혻챦쩐혝챦쩐혘챈짹혨O^ 1100001110100110110000111011010111000010100001111100001110101000110000101011010011000010100001111100001110101101110000111010000011000010100010101100001110101000110000101010110011000010101000001100001110101111110000101011111011000010100001111100001110101111110000101011111011000010100000111100001110100110110000101011000111000010100100000100111101011110 c3a6c3b5c287c3a8c2b4c287c3adc3a0c28ac3a8c2acc2a0c3afc2bec287c3afc2bec283c3a6c2b1c2904f5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)