To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????v?????????vB 001111110011111100111111001111110011111100111111001111110011111100111111011101100011111100111111001111110011111100111111001111110011111100111111001111110111011001000010 3f3f3f3f3f3f3f3f3f763f3f3f3f3f3f3f3f3f7642
SJIS-WIN 耶????????v耶????????vB 1001011011101011001111110011111100111111001111110011111100111111001111110011111101110110100101101110101100111111001111110011111100111111001111110011111100111111001111110111011001000010 96eb3f3f3f3f3f3f3f3f7696eb3f3f3f3f3f3f3f3f7642
EUC-JP 耶??庾?????v耶??庾?????vB 110011001110110100111111001111111000111110111100110011100011111100111111001111110011111100111111011101101100110011101101001111110011111110001111101111001100111000111111001111110011111100111111001111110111011001000010 cced3f3f8fbcce3f3f3f3f3f76cced3f3f8fbcce3f3f3f3f3f7642
UTF-8 耶쇨퍌庾당넭戮⑸쐲v耶쇨퍌庾당넭戮⑸쐲vB 111010001000000010110110111011001000011110101000111011011000110110001100111001011011101010111110111010111000101110111001111010111000010010101101111011111010011110010010111000101001000110111000111011001001000010110010011101101110100010000000101101101110110010000111101010001110110110001101100011001110010110111010101111101110101110001011101110011110101110000100101011011110111110100111100100101110001010010001101110001110110010010000101100100111011001000010 e880b6ec87a8ed8d8ce5babeeb8bb9eb84adefa792e291b8ec90b276e880b6ec87a8ed8d8ce5babeeb8bb9eb84adefa792e291b8ec90b27642
UHC 耶쇨퍌庾당넭戮⑸쐲v耶쇨퍌庾당넭戮⑸쐲vB 111001011010110110111100111010101011101110000011111010101110110010110100111001111000011010101100111010111011110110101001111010111001110010010101011101101110010110101101101111001110101010111011100000111110101011101100101101001110011110000110101011001110101110111101101010011110101110011100100101010111011001000010 e5adbceabb83eaecb4e786acebbda9eb9c9576e5adbceabb83eaecb4e786acebbda9eb9c957642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)