To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????E 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000101 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f45
SJIS-WIN ??????領??暮??????領??矛E 00111111001111110011111100111111001111110011111110010111110011000011111100111111100101011110100100111111001111110011111100111111001111110011111110010111110011000011111100111111100101101011010101000101 3f3f3f3f3f3f97cc3f3f95e93f3f3f3f3f3f97cc3f3f96b545
EUC-JP ??????領??暮??????領??矛E 00111111001111110011111100111111001111110011111111001110110011100011111100111111110010101110101100111111001111110011111100111111001111110011111111001110110011100011111100111111110011001011011101000101 3f3f3f3f3f3fcece3f3fcaeb3f3f3f3f3f3fcece3f3fccb745
UTF-8 렻매렎렻렚렻領렰렻暮렻매렎렻렚렻領렰렻矛E 11101011101000001011101111101011101001111010010011101011101000001000111011101011101000001011101111101011101000001001101011101011101000001011101111101001101000001001100011101011101000001011000011101011101000001011101111100110100110101010111011101011101000001011101111101011101001111010010011101011101000001000111011101011101000001011101111101011101000001001101011101011101000001011101111101001101000001001100011101011101000001011000011101011101000001011101111100111100111111001101101000101 eba0bbeba7a4eba08eeba0bbeba09aeba0bbe9a098eba0b0eba0bbe69aaeeba0bbeba7a4eba08eeba0bbeba09aeba0bbe9a098eba0b0eba0bbe79f9b45
UHC 렻매렎렻렚렻領렰렻暮렻매렎렻렚렻領렰렻矛E 1000111011000011101110001100010110001110101001001000111011000011100011101010110110001110110000111101011011000101100011101011110110001110110000111101100110111010100011101100001110111000110001011000111010100100100011101100001110001110101011011000111011000011110101101100010110001110101111011000111011000011110110011100001101000101 8ec3b8c58ea48ec38ead8ec3d6c58ebd8ec3d9ba8ec3b8c58ea48ec38ead8ec3d6c58ebd8ec3d9c345

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)