To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????????? 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???旭?埃???????央佃よ佃ュ佃???? 00111111001111110011111110001000101011100011111110011010101110100011111100111111001111110011111100111111001111110011111110001001100110111001001011001111100000101110011010010010110011111000001110000101100100101100111100111111001111110011111100111111 3f3f3f88ae3f9aba3f3f3f3f3f3f3f899b92cf82e692cf838592cf3f3f3f3f
EUC-JP ???旭?埃???????央佃よ佃ュ佃???? 00111111001111110011111110110000101100000011111111010100101111000011111100111111001111110011111100111111001111110011111110110001111110111100010011010001101001001110100011000100110100011010010111100101110001001101000100111111001111110011111100111111 3f3f3fb0b03fd4bc3f3f3f3f3f3f3fb1fbc4d1a4e8c4d1a5e5c4d13f3f3f3f
UTF-8 쒀烈렕旭렖埃렖렣쒔렕렊롛뤎央佃よ佃ュ佃쳩늅춲첁 111011001001001010000000111011111010011010011111111010111010000010010101111001101001011110101101111010111010000010010110111001011001111110000011111010111010000010010110111010111010000010100011111011001001001010010100111010111010000010010101111010111010000010001010111010111010000110011011111010111010010010001110111001011010010010101110111001001011110110000011111000111000001010001000111001001011110110000011111000111000001110100101111001001011110110000011111011001011001110101001111010111000101010000101111011001011011010110010111011001011001010000001 ec9280efa69feba095e697adeba096e59f83eba096eba0a3ec9294eba095eba08aeba19beba48ee5a4aee4bd83e38288e4bd83e383a5e4bd83ecb3a9eb8a85ecb6b2ecb281
UHC 쒀烈렕旭렖埃렖렣쒔렕렊롛뤎央佃よ佃ュ佃쳩늅춲첁 10111110101011001110011011101111100011101010101011101001111011111000111010101011111001001110111110001110101010111000111010110100101111101010110110001110101010101000111010100001100011101101111110001111101111101110010011100111111011101110110010101010111010001110111011101100101010111110010111101110111011001010101110001110101101001011111010101101100011101010101010001110 beace6ef8eaae9ef8eabe4ef8eab8eb4bead8eaa8ea18edf8fbee4e7eeecaae8eeecabe5eeecab8eb4bead8eaa8e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)