To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????????^ 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN タホチカタ鄲ホチカタ躰タホチカタ鄲ホチカタ躰^ 110000001100111011000001101101101100000011100111110000001100111011000001101101101100000011100111010110111100000011001110110000011011011011000000111001111100000011001110110000011011011011000000111001110101101101011110 c0cec1b6c0e7c0cec1b6c0e75bc0cec1b6c0e7c0cec1b6c0e75b5e
EUC-JP タホチカタ鄲ホチカタ躰タホチカタ鄲ホチカタ躰^ 100011101100000010001110110011101000111011000001100011101011011010001110110000001110111011000010100011101100111010001110110000011000111010110110100011101100000011101101101111001000111011000000100011101100111010001110110000011000111010110110100011101100000011101110110000101000111011001110100011101100000110001110101101101000111011000000111011011011110001011110 8ec08ece8ec18eb68ec0eec28ece8ec18eb68ec0edbc8ec08ece8ec18eb68ec0eec28ece8ec18eb68ec0edbc5e
UTF-8 タホチカタ鄲ホチカタ躰タホチカタ鄲ホチカタ躰^ 11101111101111101000000011101111101111101000111011101111101111101000000111101111101111011011011011101111101111101000000011101001100001001011001011101111101111101000111011101111101111101000000111101111101111011011011011101111101111101000000011101000101110101011000011101111101111101000000011101111101111101000111011101111101111101000000111101111101111011011011011101111101111101000000011101001100001001011001011101111101111101000111011101111101111101000000111101111101111011011011011101111101111101000000011101000101110101011000001011110 efbe80efbe8eefbe81efbdb6efbe80e984b2efbe8eefbe81efbdb6efbe80e8bab0efbe80efbe8eefbe81efbdb6efbe80e984b2efbe8eefbe81efbdb6efbe80e8bab05e
UHC ?????鄲??????????鄲?????^ 00111111001111110011111100111111001111111101001110110011001111110011111100111111001111110011111100111111001111110011111100111111001111111101001110110011001111110011111100111111001111110011111101011110 3f3f3f3f3fd3b33f3f3f3f3f3f3f3f3f3fd3b33f3f3f3f3f5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)