To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????? 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 炭卒捉炭卒測炭其息炭其捉炭卒属炭其 10010010010110011001000110110010100100011010100010010010010110011001000110110010100100011010101010010010010110011001000110110100100100011010011110010010010110011001000110110100100100011010100010010010010110011001000110110010100100011010111010010010010110011001000110110100 925991b291a8925991b291aa925991b491a7925991b491a8925991b291ae925991b4
EUC-JP 炭卒捉炭卒測炭其息炭其捉炭卒属炭其 11000011101110101100001010110100110000101010101011000011101110101100001010110100110000101010110011000011101110101100001010110110110000101010100111000011101110101100001010110110110000101010101011000011101110101100001010110100110000101011000011000011101110101100001010110110 c3bac2b4c2aac3bac2b4c2acc3bac2b6c2a9c3bac2b6c2aac3bac2b4c2b0c3bac2b6
UTF-8 炭卒捉炭卒測炭其息炭其捉炭卒属炭其 111001111000001010101101111001011000110110010010111001101000110110001001111001111000001010101101111001011000110110010010111001101011100010101100111001111000001010101101111001011000010110110110111001101000000110101111111001111000001010101101111001011000010110110110111001101000110110001001111001111000001010101101111001011000110110010010111001011011000110011110111001111000001010101101111001011000010110110110 e782ade58d92e68d89e782ade58d92e6b8ace782ade585b6e681afe782ade585b6e68d89e782ade58d92e5b19ee782ade585b6
UHC 炭卒捉炭卒測炭其息炭其捉炭卒?炭其 111101111010100111110000111011111111001110110101111101111010100111110000111011111111011010110100111101111010100111010000111011001110001111010011111101111010100111010000111011001111001110110101111101111010100111110000111011110011111111110111101010011101000011101100 f7a9f0eff3b5f7a9f0eff6b4f7a9d0ece3d3f7a9d0ecf3b5f7a9f0ef3ff7a9d0ec

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)