To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ??縡?衣??經舞醍??縡?衣??經舞怨倣 0011111100111111111000110111000100111111100010001101111100111111001111111110001101010011100101011001000110010001111001110011111100111111111000110111000100111111100010001101111100111111001111111110001101010011100101011001000110001001100001011001010111101101 3f3fe3713f88df3f3fe353959191e73f3fe3713f88df3f3fe3539591898595ed
EUC-JP ??縡?衣??經舞醍??縡?衣??經舞怨倣 0011111100111111111001011101001000111111101100001110000100111111001111111110010110110100110010011111000111000010111010010011111100111111111001011101001000111111101100001110000100111111001111111110010110110100110010011111000110110001111001011100101011101111 3f3fe5d23fb0e13f3fe5b4c9f1c2e93f3fe5d23fb0e13f3fe5b4c9f1b1e5caef
UTF-8 欌렪縡렕衣쭸렮經舞醍닻떵縡렕衣쭸렮經舞怨倣 111001101010110010001100111010111010000010101010111001111011100010100001111010111010000010010101111010001010000110100011111011001010110110111000111010111010000010101110111001111011011010010011111010001000100010011110111010011000011010001101111010111000101110111011111010111001011010110101111001111011100010100001111010111010000010010101111010001010000110100011111011001010110110111000111010111010000010101110111001111011011010010011111010001000100010011110111001101000000010101000111001011000000010100011 e6ac8ceba0aae7b8a1eba095e8a1a3ecadb8eba0aee7b693e8889ee9868deb8bbbeb96b5e7b8a1eba095e8a1a3ecadb8eba0aee7b693e8889ee680a8e580a3
UHC 欌렪縡렕衣쭸렮經舞醍닻떵縡렕衣쭸렮經舞怨倣 111011011110101110001110101110001110111010101101100011101010101011101011111111011100001011100110100011101011101111001100111010001101100111110001111100001011010110110100111010011011011010111010111011101010110110001110101010101110101111111101110000101110011010001110101110111100110011101000110110011111000111101010101100111101101110100111 edeb8eb8eead8eaaebfdc2e68ebbcce8d9f1f0b5b4e9b6baeead8eaaebfdc2e68ebbcce8d9f1eab3dba7

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)