To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????E 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000101 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f45
SJIS-WIN ???????暮???????矛E 00111111001111110011111100111111001111110011111100111111100101011110100100111111001111110011111100111111001111110011111100111111100101101011010101000101 3f3f3f3f3f3f3f95e93f3f3f3f3f3f3f96b545
EUC-JP ???????暮???????矛E 00111111001111110011111100111111001111110011111100111111110010101110101100111111001111110011111100111111001111110011111100111111110011001011011101000101 3f3f3f3f3f3f3fcaeb3f3f3f3f3f3f3fccb745
UTF-8 렻렓렺렪렻렎렻暮렻렓렺렪렻렎렻矛E 11101011101000001011101111101011101000001001001111101011101000001011101011101011101000001010101011101011101000001011101111101011101000001000111011101011101000001011101111100110100110101010111011101011101000001011101111101011101000001001001111101011101000001011101011101011101000001010101011101011101000001011101111101011101000001000111011101011101000001011101111100111100111111001101101000101 eba0bbeba093eba0baeba0aaeba0bbeba08eeba0bbe69aaeeba0bbeba093eba0baeba0aaeba0bbeba08eeba0bbe79f9b45
UHC 렻렓렺렪렻렎렻暮렻렓렺렪렻렎렻矛E 100011101100001110001110101010001000111011000010100011101011100010001110110000111000111010100100100011101100001111011001101110101000111011000011100011101010100010001110110000101000111010111000100011101100001110001110101001001000111011000011110110011100001101000101 8ec38ea88ec28eb88ec38ea48ec3d9ba8ec38ea88ec28eb88ec38ea48ec3d9c345

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)