To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????c??????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110110001100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f633f3f3f3f3f3f3f
SJIS-WIN ???????????釗??c??????? 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111111110111011101100111111001111110110001100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3ffbbb3f3f633f3f3f3f3f3f3f
EUC-JP ???????????釗??c???嫄??? 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111100011111110001110100110001111110011111101100011001111110011111100111111100011111011101010100001001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f8fe3a63f3f633f3f3f8fbaa13f3f3f
UTF-8 溜삘뵗溜췊溜삠뀛溜띾졎釗숇젒c溜삠뀛嫄겸뵗溜 11101111101001111000101111101100100000101001100011101011101101011001011111101111101001111000101111101100101101111000101011101111101001111000101111101100100000101010000011101011100000001001101111101111101001111000101111101011100111011011111011101100101000011000111011101001100001111001011111101100100010001000011111101100101000001001001001100011111011111010011110001011111011001000001010100000111010111000000010011011111001011010101110000100111010101011001010111000111010111011010110010111111011111010011110001011 efa78bec8298ebb597efa78becb78aefa78bec82a0eb809befa78beb9dbeeca18ee98797ec8887eca09263efa78bec82a0eb809be5ab84eab2b8ebb597efa78b
UHC 溜삘뵗溜췊溜삠뀛溜띾졎釗숇젒c溜삠뀛嫄겸뵗溜 11101010111111101011101111100010100101001001100111101010111111101010111001000101111010101111111010111011111000111000010110010100111010101111111010001101111010111010000010111011111000011111001010011001111010111010000010010001011000111110101011111110101110111110001110000101100101001110101010110001101100001110001010010100100110011110101011111110 eafebbe29499eafeae45eafebbe38594eafe8deba0bbe1f299eba09163eafebbe38594eab1b0e29499eafe

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)