To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 永??寃??壤??泣?永??寃???茵 1000100101101001001111110011111110011011100000110011111100111111100110101101111100111111001111111000101110000011001111111000100101101001001111110011111110011011100000110011111100111111001111111110010010011111 89693f3f9b833f3f9adf3f3f8b833f89693f3f9b833f3f3fe49f
EUC-JP 永??寃??壤??泣?永??寃??琯茵 10110001110010100011111100111111110101011110001100111111001111111101010011100001001111110011111110110101111000110011111110110001110010100011111100111111110101011110001100111111001111111000111111001100101100111110100010100001 b1ca3f3fd5e33f3fd4e13f3fb5e33fb1ca3f3fd5e33f3f8fccb3e8a1
UTF-8 永띔퇊寃쏁걣壤깆쥜泣쩈永띔퇊寃쎾푻琯茵 111001101011000010111000111010111001110110010100111011011000011110001010111001011010111110000011111011001000111110000001111010101011000110100011111001011010001110100100111010101011100110000110111011001010010110011100111001101011001110100011111011001010100110001000111001101011000010111000111010111001110110010100111011011000011110001010111001011010111110000011111011001000111010111110111011011001000110111011111001111001000010101111111010001000110010110101 e6b0b8eb9d94ed878ae5af83ec8f81eab1a3e5a3a4eab986eca59ce6b3a3eca988e6b0b8eb9d94ed878ae5af83ec8ebeed91bbe790afe88cb5
UHC 永띔퇊寃쏁걣壤깆쥜泣쩈永띔퇊寃쎾푻琯茵 1110011110110101101101101110101010110111100110111110101010110010100110111110011110000001100011001110010110111101101100011110110010100010100100011110101111101000101001010100001011100111101101011011011011101010101101111001101111101010101100101001101111100101101111101000011111001110101101011110110011100000 e7b5b6eab79beab29be7818ce5bdb1eca291ebe8a542e7b5b6eab79beab29be5be87ceb5ece0

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)