To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????GB 0011111100111111001111110011111100111111001111110011111100111111001111110100011101000010 3f3f3f3f3f3f3f3f3f4742
SJIS-WIN 奪村形棚他贈棚炭促GB 1001001001000100100100011011101010001100011000001001001001001001100100011011110010010001101000011001001001001001100100100101100110010001101000110100011101000010 924491ba8c60924991bc91a19249925991a34742
EUC-JP 奪村形棚他贈棚炭促GB 1100001110100101110000101011110010110111110000011100001110101010110000101011111011000010101000111100001110101010110000111011101011000010101001010100011101000010 c3a5c2bcb7c1c3aac2bec2a3c3aac3bac2a54742
UTF-8 奪村形棚他贈棚炭促GB 1110010110100101101010101110011010011101100100011110010110111101101000101110011010100011100110101110010010111011100101101110100010110100100010001110011010100011100110101110011110000010101011011110010010111111100000110100011101000010 e5a5aae69d91e5bda2e6a39ae4bb96e8b488e6a39ae782ade4bf834742
UHC 奪村形棚他贈棚炭促GB 1111011110101100111101011011110111111011101000011101110111011100111101101110001011110001111111001101110111011100111101111010100111110101101101010100011101000010 f7acf5bdfba1dddcf6e2f1fcdddcf7a9f5b54742

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)