To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????B 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 迢ク螻樊搗迢ク螻樣●迢ク螻樊搗迢ク螻樣●B 11100111100010111011100011100101101100011001111011100110100111011001000111100111100010111011100011100101101100011001111011101001100000011001110011100111100010111011100011100101101100011001111011100110100111011001000111100111100010111011100011100101101100011001111011101001100000011001110001000010 e78bb8e5b19ee69d91e78bb8e5b19ee9819ce78bb8e5b19ee69d91e78bb8e5b19ee9819c42
EUC-JP 迢ク螻樊搗迢ク螻樣●迢ク螻樊搗迢ク螻樣●B 1110110111101011100011101011100011101010101100111101110011101000110110011111000111101101111010111000111010111000111010101011001111011100111010111010000111111100111011011110101110001110101110001110101010110011110111001110100011011001111100011110110111101011100011101011100011101010101100111101110011101011101000011111110001000010 edeb8eb8eab3dce8d9f1edeb8eb8eab3dceba1fcedeb8eb8eab3dce8d9f1edeb8eb8eab3dceba1fc42
UTF-8 迢ク螻樊搗迢ク螻樣●迢ク螻樊搗迢ク螻樣●B 11101000101111111010001011101111101111011011100011101000100111101011101111100110101010001000101011100110100100001001011111101000101111111010001011101111101111011011100011101000100111101011101111100110101010001010001111100010100101111000111111101000101111111010001011101111101111011011100011101000100111101011101111100110101010001000101011100110100100001001011111101000101111111010001011101111101111011011100011101000100111101011101111100110101010001010001111100010100101111000111101000010 e8bfa2efbdb8e89ebbe6a88ae69097e8bfa2efbdb8e89ebbe6a8a3e2978fe8bfa2efbdb8e89ebbe6a88ae69097e8bfa2efbdb8e89ebbe6a8a3e2978f42
UHC ???樊搗???樣●???樊搗???樣●B 0011111100111111001111111101101111100000110100111111110100111111001111110011111111100101110001101010000111011100001111110011111100111111110110111110000011010011111111010011111100111111001111111110010111000110101000011101110001000010 3f3f3fdbe0d3fd3f3f3fe5c6a1dc3f3f3fdbe0d3fd3f3f3fe5c6a1dc42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)