To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????}v????????}vB 001111110011111100111111001111110011111100111111001111110011111101111101011101100011111100111111001111110011111100111111001111110011111100111111011111010111011001000010 3f3f3f3f3f3f3f3f7d763f3f3f3f3f3f3f3f7d7642
SJIS-WIN 芸?怨捧??障?}v芸?怨捧??障?}vB 1000110001111100001111111000100110000101100101011111100100111111001111111000111111100001001111110111110101110110100011000111110000111111100010011000010110010101111110010011111100111111100011111110000100111111011111010111011001000010 8c7c3f898595f93f3f8fe13f7d768c7c3f898595f93f3f8fe13f7d7642
EUC-JP 芸?怨捧??障?}v芸?怨捧??障?}vB 1011011111011101001111111011000111100101110010101111101100111111001111111011111011100011001111110111110101110110101101111101110100111111101100011110010111001010111110110011111100111111101111101110001100111111011111010111011001000010 b7dd3fb1e5cafb3f3fbee33f7d76b7dd3fb1e5cafb3f3fbee33f7d7642
UTF-8 芸렑怨捧렜렕障렚}v芸렑怨捧렜렕障렚}vB 1110100010001010101110001110101110100000100100011110011010000000101010001110011010001101101001111110101110100000100111001110101110100000100101011110100110011010100111001110101110100000100110100111110101110110111010001000101010111000111010111010000010010001111001101000000010101000111001101000110110100111111010111010000010011100111010111010000010010101111010011001101010011100111010111010000010011010011111010111011001000010 e88ab8eba091e680a8e68da7eba09ceba095e99a9ceba09a7d76e88ab8eba091e680a8e68da7eba09ceba095e99a9ceba09a7d7642
UHC 芸렑怨捧렜렕障렚}v芸렑怨捧렜렕障렚}vB 11101001111111011000111010100110111010101011001111011100111010011000111010101110100011101010101011101110101000011000111010101101011111010111011011101001111111011000111010100110111010101011001111011100111010011000111010101110100011101010101011101110101000011000111010101101011111010111011001000010 e9fd8ea6eab3dce98eae8eaaeea18ead7d76e9fd8ea6eab3dce98eae8eaaeea18ead7d7642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)