To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 亦??????с?夜?????循??畑?? 1001011010010010001111110011111100111111001111110011111100111111100001001000001100111111100101101110100100111111001111110011111100111111001111111000111101111010001111110011111110010100101010000011111100111111 96923f3f3f3f3f3f84833f96e93f3f3f3f3f8f7a3f3f94a83f3f
EUC-JP 亦??嫄???с?夜?????循??畑?? 11001011111100100011111100111111100011111011101010100001001111110011111100111111101001111110001100111111110011001110101100111111001111110011111100111111001111111011110111011011001111110011111111001000101010100011111100111111 cbf23f3f8fbaa13f3f3fa7e33fcceb3f3f3f3f3fbddb3f3fc8aa3f3f
UTF-8 亦껉퀓嫄띌쨹戮с걧夜껋뮂杻ⓨ죰循녿옒畑듭뼰 1110010010111010101001101110101010111011100010011110110110000000100100111110010110101011100001001110101110011101100011001110110010101000101110011110111110100111100100101101000110000001111010101011000110100111111001011010010010011100111010101011101110001011111010111010111010000010111011111010011110001000111000101001001110101000111011001010001110110000111001011011111010101010111010111000010110111111111011001001100010010010111001111001010110010001111010111001001110101101111010111011110010110000 e4baa6eabb89ed8093e5ab84eb9d8ceca8b9efa792d181eab1a7e5a49ceabb8bebae82efa788e293a8eca3b0e5beaaeb85bfec9892e79591eb93adebbcb0
UHC 亦껉퀓嫄띌쨹戮с걧夜껋뮂杻ⓨ죰循녿옒畑듭뼰 111001101011001010000011111010101011001110001000111010101011000110110110111010011010010010010011111010111011110110101100111000111000000110010000111001011010100010000011111011001001001010010001111010101111010010101000111001011010000110001011111000101110000010000110111010111001111010011000111011111010010110110101111011001001011010110011 e6b283eab388eab1b6e9a493ebbdace38190e5a883ec9291eaf4a8e5a18be2e086eb9e98efa5b5ec96b3

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)