To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 夜??吟??矣??夜??釉ワ?循??筌 100101101110100100111111001111111000101111100001001111110011111111100001111000010011111100111111100101101110100100111111001111111110011111010110100000111000111100111111100011110111101000111111001111111110001010100011 96e93f3f8be13f3fe1e13f3f96e93f3fe7d6838f3f8f7a3f3fe2a3
EUC-JP 夜??吟??矣??夜??釉ワ?循??筌 110011001110101100111111001111111011011011100011001111110011111111100010111000110011111100111111110011001110101100111111001111111110111011011000101001011110111100111111101111011101101100111111001111111110010010100101 cceb3f3fb6e33f3fe2e33f3fcceb3f3feed8a5ef3fbddb3f3fe4a5
UTF-8 夜쏅뗄吟당몭矣⑹젵夜껊뎿釉ワ쭓循뗫닂筌 111001011010010010011100111011001000111110000101111010111001011110000100111001011001000010011111111010111000101110111001111010111010101010101101111001111001111110100011111000101001000110111001111011001010000010110101111001011010010010011100111010101011101110001010111010111000111010111111111010011000011110001001111000111000001110101111111011001010110110010011111001011011111010101010111010111001011110101011111010111000101110000010111001111010110110001100 e5a49cec8f85eb9784e5909feb8bb9ebaaade79fa3e291b9eca0b5e5a49ceabb8aeb8ebfe98789e383afecad93e5beaaeb97abeb8b82e7ad8c
UHC 夜쏅뗄吟당몭矣⑹젵夜껊뎿釉ワ쭓循뗫닂筌 1110010110101000100110111110101110110110101111111110101111100001101101001110011110010001100101111110101111111000101010011110110010100000101010011110010110101000100000111110101110001001100100101110101110111000101010111110111110100111100010111110001011100000100010111110101110001000100010111110111110100111 e5a89bebb6bfebe1b4e79197ebf8a9eca0a9e5a883eb8992ebb8abefa78be2e08beb888befa7

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)