To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????????? 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 癲??違?ⅱ應??裔??楢??揄ъ?亦??溢? 111000011001111100111111001111111000100011100001001111111111101001000001100111001110010000111111001111111110010111100001001111110011111110010011111010000011111100111111100111011000100110000100100011000011111110010110100100100011111100111111100010001110110000111111 e19f3f3f88e13ffa419ce43f3fe5e13f3f93e83f3f9d89848c3f96923f3f88ec3f
EUC-JP 癲??違??應??裔??楢??揄ъ?亦??溢? 1110001010100001001111110011111110110000111000110011111100111111110110001110011000111111001111111110101011100011001111110011111111000110111010100011111100111111110110011110100110100111111011000011111111001011111100100011111100111111101100001110111000111111 e2a13f3fb0e33f3fd8e63f3feae33f3fc6ea3f3fd9e9a7ec3fcbf23f3fb0ee3f
UTF-8 癲앷풝違욘ⅱ應쇱쾻裔꾧쑈楢얏젔揄ъ럷亦껋쉸溢숥 1110011110011001101100101110110010010101101101111110110110010010100111011110100110000001100101011110110010011010100110001110001010000101101100011110011010000111100010011110110010000111101100011110110010111110101110111110100010100011100101001110101010111110101001111110110010010001100010001110011010100101101000101110110010010110100011111110110010100000100101001110011010001111100001001101000110001010111010111001111110110111111001001011101010100110111010101011101110001011111011001000100110111000111001101011101010100010111011001000100010100101 e799b2ec95b7ed929de98195ec9a98e285b1e68789ec87b1ecbebbe8a394eabea7ec9188e6a5a2ec968feca094e68f84d18aeb9fb7e4baa6eabb8bec89b8e6baa2ec88a5
UHC 癲앷풝違욘ⅱ應쇱쾻裔꾧쑈楢얏젔揄ъ럷亦껋쉸溢숥 11101111101001101001110111101010101111101010000011101010110111101011111111100110101001011010001011101011111010111011110011101100101100101001000111100111111000001000010011101010101111101010010011101010111110011011111011100110101000001001001011101010111100011010110011101100100011101001011011100110101100101000001111101100100110101000111011101100111011101001101001000010 efa69deabea0eadebfe6a5a2ebebbcecb291e7e084eabea4eaf9bee6a092eaf1acec8e96e6b283ec9a8eecee9a42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)