To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???C??????m}???C??????m{^ 00111111001111110011111101000011001111110011111100111111001111110011111100111111011011010111110100111111001111110011111101000011001111110011111100111111001111110011111100111111011011010111101101011110 3f3f3f433f3f3f3f3f3f6d7d3f3f3f433f3f3f3f3f3f6d7b5e
SJIS-WIN ???C??????m}???C??????m{^ 00111111001111110011111101000011001111110011111100111111001111110011111100111111011011010111110100111111001111110011111101000011001111110011111100111111001111110011111100111111011011010111101101011110 3f3f3f433f3f3f3f3f3f6d7d3f3f3f433f3f3f3f3f3f6d7b5e
EUC-JP ???C??????m}???C??????m{^ 00111111001111110011111101000011001111110011111100111111001111110011111100111111011011010111110100111111001111110011111101000011001111110011111100111111001111110011111100111111011011010111101101011110 3f3f3f433f3f3f3f3f3f6d7d3f3f3f433f3f3f3f3f3f6d7b5e
UTF-8 챌짬짱C채쨍쨔챔쩐쩔m}챌짬짱C채쨍쨔챔쩐쩔m{^ 11101100101100011000110011101100101001111010110011101100101001111011000101000011111011001011000110000100111011001010100010001101111011001010100010010100111011001011000110010100111011001010100110010000111011001010100110010100011011010111110111101100101100011000110011101100101001111010110011101100101001111011000101000011111011001011000110000100111011001010100010001101111011001010100010010100111011001011000110010100111011001010100110010000111011001010100110010100011011010111101101011110 ecb18ceca7aceca7b143ecb184eca88deca894ecb194eca990eca9946d7decb18ceca7aceca7b143ecb184eca88deca894ecb194eca990eca9946d7b5e
UHC 챌짬짱C채쨍쨔챔쩐쩔m}챌짬짱C채쨍쨔챔쩐쩔m{^ 11000011101001111100001010101011110000101010111101000011110000111010010011000010101110001100001010111001110000111010100011000010101111101100001010111111011011010111110111000011101001111100001010101011110000101010111101000011110000111010010011000010101110001100001010111001110000111010100011000010101111101100001010111111011011010111101101011110 c3a7c2abc2af43c3a4c2b8c2b9c3a8c2bec2bf6d7dc3a7c2abc2af43c3a4c2b8c2b9c3a8c2bec2bf6d7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)