To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 癲??乳?.轅??碍??悠??釉??甕 111000011001111100111111001111111001001111111011001111111000000101000100111001110111011000111111001111111000101001010110001111110011111110010111010010010011111100111111111001111101011000111111001111111110000101010000 e19f3f3f93fb3f8144e7763f3f8a563f3f97493f3fe7d63f3fe150
EUC-JP 癲??乳?.轅??碍??悠??釉??甕 111000101010000100111111001111111100011011111101001111111010000110100101111011011101011100111111001111111011001110110111001111110011111111001101101010100011111100111111111011101101100000111111001111111110000110110001 e2a13f3fc6fd3fa1a5edd73f3fb3b73f3fcdaa3f3feed83f3fe1b1
UTF-8 癲숈슜乳꿴.轅⑹벑碍⑹룇悠뉒븦釉먮쑞甕 111001111001100110110010111011001000100010001000111011001000101010011100111001001011100110110011111010101011111110110100111011111011110010001110111010001011110110000101111000101001000110111001111010111011001010010001111001111010001010001101111000101001000110111001111010111010001110000111111001101000001010100000111010111000100110010010111010111011100010100110111010011000011110001001111010111010100010101110111011001001000110011110111001111001010010010101 e799b2ec8888ec8a9ce4b9b3eabfb4efbc8ee8bd85e291b9ebb291e7a28de291b9eba387e682a0eb8992ebb8a6e98789eba8aeec919ee79495
UHC 癲숈슜乳꿴.轅⑹벑碍⑹룇悠뉒븦釉먮쑞甕 1110111110100110100110011110110010011010101010011110101011100001101100101110100110100011101011101110101010111111101010011110110010010011101100011110010011110100101010011110110010001111100001101110101011101101100001111110011110010101100011111110101110111000100100001110101110011100101111011110100010111000 efa699ec9aa9eae1b2e9a3aeeabfa9ec93b1e4f4a9ec8f86eaed87e7958febb890eb9cbde8b8

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)