To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????[????????[^ 00111111001111110011111100111111001111110011111100111111001111110101101100111111001111110011111100111111001111110011111100111111001111110101101101011110 3f3f3f3f3f3f3f3f5b3f3f3f3f3f3f3f3f5b5e
SJIS-WIN 坦卒淡探坦卒淡探[坦卒淡探坦卒淡探[^ 1001001001010010100100011011001010010010010101111001001001010100100100100101001010010001101100101001001001010111100100100101010001011011100100100101001010010001101100101001001001010111100100100101010010010010010100101001000110110010100100100101011110010010010101000101101101011110 925291b292579254925291b2925792545b925291b292579254925291b2925792545b5e
EUC-JP 坦卒淡探坦卒淡探[坦卒淡探坦卒淡探[^ 1100001110110011110000101011010011000011101110001100001110110101110000111011001111000010101101001100001110111000110000111011010101011011110000111011001111000010101101001100001110111000110000111011010111000011101100111100001010110100110000111011100011000011101101010101101101011110 c3b3c2b4c3b8c3b5c3b3c2b4c3b8c3b55bc3b3c2b4c3b8c3b5c3b3c2b4c3b8c3b55b5e
UTF-8 坦卒淡探坦卒淡探[坦卒淡探坦卒淡探[^ 111001011001110110100110111001011000110110010010111001101011011110100001111001101000111010100010111001011001110110100110111001011000110110010010111001101011011110100001111001101000111010100010010110111110010110011101101001101110010110001101100100101110011010110111101000011110011010001110101000101110010110011101101001101110010110001101100100101110011010110111101000011110011010001110101000100101101101011110 e59da6e58d92e6b7a1e68ea2e59da6e58d92e6b7a1e68ea25be59da6e58d92e6b7a1e68ea2e59da6e58d92e6b7a1e68ea25b5e
UHC 坦卒淡探坦卒淡探[坦卒淡探坦卒淡探[^ 1111011110100100111100001110111111010011101111111111011110101110111101111010010011110000111011111101001110111111111101111010111001011011111101111010010011110000111011111101001110111111111101111010111011110111101001001111000011101111110100111011111111110111101011100101101101011110 f7a4f0efd3bff7aef7a4f0efd3bff7ae5bf7a4f0efd3bff7aef7a4f0efd3bff7ae5b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)