To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN ??????管悠??碎λ???????逸?B 001111110011111100111111001111110011111100111111100010101100011110010111010010010011111100111111111000011110101010000011110010010011111100111111001111110011111100111111001111110011111110001000111011010011111101000010 3f3f3f3f3f3f8ac797493f3fe1ea83c93f3f3f3f3f3f3f88ed3f42
EUC-JP ???佾??管悠??碎λ???????逸?B 0011111100111111001111111000111110110000111110110011111100111111101101001100100111001101101010100011111100111111111000101110110010100110110010110011111100111111001111110011111100111111001111110011111110110000111011110011111101000010 3f3f3f8fb0fb3f3fb4c9cdaa3f3fe2eca6cb3f3f3f3f3f3f3fb0ef3f42
UTF-8 麗몃쓷佾쒏룚管悠끾뉩碎λ뮪麗몃쓷流섇쳞逸쌬B 111011111010011010001000111010111010101010000011111011001001001110110111111001001011110110111110111011001001001010001111111010111010001110011010111001111010111010100001111001101000001010100000111010111000000110111110111010111000100110101001111001111010001010001110110011101011101111101011101011101010101011101111101001101000100011101011101010101000001111101100100100111011011111101111101001111000101011101100100001001000011111101100101100111001111011101001100000001011100011101100100011001010110001000010 efa688ebaa83ec93b7e4bdbeec928feba39ae7aea1e682a0eb81beeb89a9e7a28ecebbebaeaaefa688ebaa83ec93b7efa78aec8487ecb39ee980b8ec8cac42
UHC 麗몃쓷佾쒏룚管悠끾뉩碎λ뮪麗몃쓷流섇쳞逸쌬B 11100110101100001011100011101011100111011001010011101100111010111001110011100110100011111001011011001110101101111110101011101101100001011110011010110100101110011110000111101111101001011110101110010010101101001110011010110000101110001110101110011101100101001110101011111100100110001110010110101011100001001110110011101111100110110101010001000010 e6b0b8eb9d94eceb9ce68f96ceb7eaed85e6b4b9e1efa5eb92b4e6b0b8eb9d94eafc98e5ab84ecef9b5442

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)