To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????????B 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 弔?瀞??狡??畯陌舞弔?衣絲????畯陌舞B 1001001010100010001111111001001111010010001111110011111111100000110000100011111100111111111110110110111111101000100110011001010110010001100100101010001000111111100010001101111111100011010011100011111100111111001111110011111111111011011011111110100010011001100101011001000101000010 92a23f93d23f3fe0c23f3ffb6fe899959192a23f88dfe34e3f3f3f3ffb6fe899959142
EUC-JP 弔?瀞??狡??畯陌舞弔?衣絲????畯陌舞B 11000100101001000011111111000110110101000011111100111111111000001100010000111111001111111000111111001101101110111110111111111001110010011111000111000100101001000011111110110000111000011110010110101111001111110011111100111111001111111000111111001101101110111110111111111001110010011111000101000010 c4a43fc6d43f3fe0c43f3f8fcdbbeff9c9f1c4a43fb0e1e5af3f3f3f3f8fcdbbeff9c9f142
UTF-8 弔렲瀞펜렲狡렕렟畯陌舞弔렲衣絲렢닿렕렟畯陌舞B 11100101101111001001010011101011101000001011001011100111100000001001111011101101100011101001110011101011101000001011001011100111100010111010000111101011101000001001010111101011101000001001111111100111100101011010111111101001100110011000110011101000100010001001111011100101101111001001010011101011101000001011001011101000101000011010001111100111101101011011001011101011101000001010001011101011100010111011111111101011101000001001010111101011101000001001111111100111100101011010111111101001100110011000110011101000100010001001111001000010 e5bc94eba0b2e7809eed8e9ceba0b2e78ba1eba095eba09fe795afe9998ce8889ee5bc94eba0b2e8a1a3e7b5b2eba0a2eb8bbfeba095eba09fe795afe9998ce8889e42
UHC 弔렲瀞펜렲狡렕렟畯陌舞弔렲衣絲렢닿렕렟畯陌舞B 111100001100000010001110101111111110111111100111110001101110011010001110101111111100111011101010100011101010101010001110101100001111000111100001110110001110100011011001111100011111000011000000100011101011111111101011111111011101111011101010100011101011001110110100111010101000111010101010100011101011000011110001111000011101100011101000110110011111000101000010 f0c08ebfefe7c6e68ebfceea8eaa8eb0f1e1d8e8d9f1f0c08ebfebfddeea8eb3b4ea8eaa8eb0f1e1d8e8d9f142

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)