To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 短贈息竪損存炭短遜奪造形短贈息竪損存炭短遜奪造形^ 10010010010110101001000110100001100100011010011110010010010001111001000110111001100100011011011010010010010110011001001001011010100100011011101110010010010001001001000110100010100011000110000010010010010110101001000110100001100100011010011110010010010001111001000110111001100100011011011010010010010110011001001001011010100100011011101110010010010001001001000110100010100011000110000001011110 925a91a191a7924791b991b69259925a91bb924491a28c60925a91a191a7924791b991b69259925a91bb924491a28c605e
EUC-JP 短贈息竪損存炭短遜奪造形短贈息竪損存炭短遜奪造形^ 11000011101110111100001010100011110000101010100111000011101010001100001010111011110000101011100011000011101110101100001110111011110000101011110111000011101001011100001010100100101101111100000111000011101110111100001010100011110000101010100111000011101010001100001010111011110000101011100011000011101110101100001110111011110000101011110111000011101001011100001010100100101101111100000101011110 c3bbc2a3c2a9c3a8c2bbc2b8c3bac3bbc2bdc3a5c2a4b7c1c3bbc2a3c2a9c3a8c2bbc2b8c3bac3bbc2bdc3a5c2a4b7c15e
UTF-8 短贈息竪損存炭短遜奪造形短贈息竪損存炭短遜奪造形^ 11100111100111111010110111101000101101001000100011100110100000011010111111100111101010111010101011100110100100001000110111100101101011011001100011100111100000101010110111100111100111111010110111101001100000011001110011100101101001011010101011101001100000001010000011100101101111011010001011100111100111111010110111101000101101001000100011100110100000011010111111100111101010111010101011100110100100001000110111100101101011011001100011100111100000101010110111100111100111111010110111101001100000011001110011100101101001011010101011101001100000001010000011100101101111011010001001011110 e79fade8b488e681afe7abaae6908de5ad98e782ade79fade9819ce5a5aae980a0e5bda2e79fade8b488e681afe7abaae6908de5ad98e782ade79fade9819ce5a5aae980a0e5bda25e
UHC 短贈息竪損存炭短遜奪造形短贈息竪損存炭短遜奪造形^ 11010011101011011111000111111100111000111101001111100010101101011110000111011111111100001110110111110111101010011101001110101101111000011110000111110111101011001111000011100011111110111010000111010011101011011111000111111100111000111101001111100010101101011110000111011111111100001110110111110111101010011101001110101101111000011110000111110111101011001111000011100011111110111010000101011110 d3adf1fce3d3e2b5e1dff0edf7a9d3ade1e1f7acf0e3fba1d3adf1fce3d3e2b5e1dff0edf7a9d3ade1e1f7acf0e3fba15e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)