To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 癲?????宥??域??悠??擬???k?油 1110000110011111001111110011111100111111001111110011111110010111010001110011111100111111100010001110011000111111001111111001011101001001001111110011111110001011010110110011111100111111001111111000001010001011001111111001011011111011 e19f3f3f3f3f3f97473f3f88e63f3f97493f3f8b5b3f3f3f828b3f96fb
EUC-JP 癲?????宥??域??悠??擬???k?油 1110001010100001001111110011111100111111001111110011111111001101101010000011111100111111101100001110100000111111001111111100110110101010001111110011111110110101101111000011111100111111001111111010001111101011001111111100110011111101 e2a13f3f3f3f3fcda83f3fb0e83f3fcdaa3f3fb5bc3f3f3fa3eb3fccfd
UTF-8 癲앷쑨鱗득룚宥뱁닞域밟뫁悠⒴슫擬꾨뙔力k맕油 111001111001100110110010111011001001010110110111111011001001000110101000111011111010011110110010111010111001001110011101111010111010001110011010111001011010111010100101111010111011000110000001111010111000101110011110111001011001111110011111111010111011000010011111111010111010101110000001111001101000001010100000111000101001001010110100111011001000101010101011111001101001001110101100111010101011111010101000111010111001100110010100111011111010011010001010111011111011110110001011111010111010011110010101111001101011001010111001 e799b2ec95b7ec91a8efa7b2eb939deba39ae5aea5ebb181eb8b9ee59f9febb09febab81e682a0e292b4ec8aabe693aceabea8eb9994efa68aefbd8beba795e6b2b9
UHC 癲앷쑨鱗득룚宥뱁닞域밟뫁悠⒴슫擬꾨뙔力k맕油 1110111110100110100111011110101010111110101001111110110011100111101101011110011010001111100101101110101011101001101110011110110110001000100111101110011010110100101110011110001010010001101001011110101011101101101010011110010110011010101101001110101111110100100001001110101110001100100110011110011010110011101000111110101110010000101001111110101011111010 efa69deabea7ece7b5e68f96eae9b9ed889ee6b4b9e291a5eaeda9e59ab4ebf484eb8c99e6b3a3eb90a7eafa

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)