To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????[?????????[^ 001111110011111100111111001111110011111100111111001111110011111100111111010110110011111100111111001111110011111100111111001111110011111100111111001111110101101101011110 3f3f3f3f3f3f3f3f3f5b3f3f3f3f3f3f3f3f3f5b5e
SJIS-WIN 蒻り?諭e?袁λ?[蒻り?諭e?袁λ?[^ 111001001110100010000010111010000011111110010111010000001000001010000101001111111110010111001101100000111100100100111111010110111110010011101000100000101110100000111111100101110100000010000010100001010011111111100101110011011000001111001001001111110101101101011110 e4e882e83f974082853fe5cd83c93f5be4e882e83f974082853fe5cd83c93f5b5e
EUC-JP 蒻り?諭e?袁λ?[蒻り?諭e?袁λ?[^ 111010001110101010100100111010100011111111001101101000011010001111100101001111111110101011001111101001101100101100111111010110111110100011101010101001001110101000111111110011011010000110100011111001010011111111101010110011111010011011001011001111110101101101011110 e8eaa4ea3fcda1a3e53feacfa6cb3f5be8eaa4ea3fcda1a3e53feacfa6cb3f5b5e
UTF-8 蒻り쑤諭e땔袁λ괏[蒻り쑤諭e땔袁λ괏[^ 11101000100100101011101111100011100000101000101011101100100100011010010011101000101010111010110111101111101111011000010111101011100101011001010011101000101000101000000111001110101110111110101010110100100011110101101111101000100100101011101111100011100000101000101011101100100100011010010011101000101010111010110111101111101111011000010111101011100101011001010011101000101000101000000111001110101110111110101010110100100011110101101101011110 e892bbe3828aec91a4e8abadefbd85eb9594e8a281cebbeab48f5be892bbe3828aec91a4e8abadefbd85eb9594e8a281cebbeab48f5b5e
UHC 蒻り쑤諭e땔袁λ괏[蒻り쑤諭e땔袁λ괏[^ 111001011011011010101010111010101011111010100101111010111011000110100011111001011011011010101010111010101011111010100101111010111011000110100011010110111110010110110110101010101110101010111110101001011110101110110001101000111110010110110110101010101110101010111110101001011110101110110001101000110101101101011110 e5b6aaeabea5ebb1a3e5b6aaeabea5ebb1a35be5b6aaeabea5ebb1a3e5b6aaeabea5ebb1a35b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)