To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????E 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000101 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f45
SJIS-WIN 蒸???味諸?誼暮蒸???味諸?誼矛E 1000111111110110001111110011111100111111100101101010000110001111100101000011111110001011011000101001010111101001100011111111011000111111001111110011111110010110101000011000111110010100001111111000101101100010100101101011010101000101 8ff63f3f3f96a18f943f8b6295e98ff63f3f3f96a18f943f8b6296b545
EUC-JP 蒸???味諸?誼暮蒸???味諸?誼矛E 1011111011111000001111110011111100111111110011001010001110111101111101000011111110110101110000111100101011101011101111101111100000111111001111110011111111001100101000111011110111110100001111111011010111000011110011001011011101000101 bef83f3f3fcca3bdf43fb5c3caebbef83f3f3fcca3bdf43fb5c3ccb745
UTF-8 蒸븃렓당味諸렪誼暮蒸븃렓당味諸렪誼矛E 11101000100100101011100011101011101110001000001111101011101000001001001111101011100010111011100111100101100100011011001111101000101010111011100011101011101000001010101011101000101010101011110011100110100110101010111011101000100100101011100011101011101110001000001111101011101000001001001111101011100010111011100111100101100100011011001111101000101010111011100011101011101000001010101011101000101010101011110011100111100111111001101101000101 e892b8ebb883eba093eb8bb9e591b3e8abb8eba0aae8aabce69aaee892b8ebb883eba093eb8bb9e591b3e8abb8eba0aae8aabce79f9b45
UHC 蒸븃렓당味諸렪誼暮蒸븃렓당味諸렪誼矛E 11110001111110101011101011101000100011101010100010110100111001111101101010101011111100001011001110001110101110001110101111111110110110011011101011110001111110101011101011101000100011101010100010110100111001111101101010101011111100001011001110001110101110001110101111111110110110011100001101000101 f1fabae88ea8b4e7daabf0b38eb8ebfed9baf1fabae88ea8b4e7daabf0b38eb8ebfed9c345

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)