To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????}?????????{^ 001111110011111100111111001111110011111100111111001111110011111100111111011111010011111100111111001111110011111100111111001111110011111100111111001111110111101101011110 3f3f3f3f3f3f3f3f3f7d3f3f3f3f3f3f3f3f3f7b5e
SJIS-WIN ???褥э?擁??}???褥э?擁??{^ 001111110011111100111111111001011111000110000100100011110011111110010111011010010011111100111111011111010011111100111111001111111110010111110001100001001000111100111111100101110110100100111111001111110111101101011110 3f3f3fe5f1848f3f97693f3f7d3f3f3fe5f1848f3f97693f3f7b5e
EUC-JP 蔣??褥э?擁??}蔣??褥э?擁??{^ 10001111110110011011011000111111001111111110101011110011101001111110111100111111110011011100101000111111001111110111110110001111110110011011011000111111001111111110101011110011101001111110111100111111110011011100101000111111001111110111101101011110 8fd9b63f3feaf3a7ef3fcdca3f3f7d8fd9b63f3feaf3a7ef3fcdca3f3f7b5e
UTF-8 蔣쏃쎁褥э슈擁븃챸}蔣쏃쎁褥э슈擁븃챸{^ 11101000100101001010001111101100100011111000001111101100100011101000000111101000101001001010010111010001100011011110110010001010100010001110011010010011100000011110101110111000100000111110110010110001101110000111110111101000100101001010001111101100100011111000001111101100100011101000000111101000101001001010010111010001100011011110110010001010100010001110011010010011100000011110101110111000100000111110110010110001101110000111101101011110 e894a3ec8f83ec8e81e8a4a5d18dec8a88e69381ebb883ecb1b87de894a3ec8f83ec8e81e8a4a5d18dec8a88e69381ebb883ecb1b87b5e
UHC 蔣쏃쎁褥э슈擁븃챸}蔣쏃쎁褥э슈擁븃챸{^ 111011011111100010011011111010011001101110101011111010011011001110101100111011111011110110110100111010001011011010111010111010001010101010000101011111011110110111111000100110111110100110011011101010111110100110110011101011001110111110111101101101001110100010110110101110101110100010101010100001010111101101011110 edf89be99babe9b3acefbdb4e8b6bae8aa857dedf89be99babe9b3acefbdb4e8b6bae8aa857b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)