To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????[?????????[^ 001111110011111100111111001111110011111100111111001111110011111100111111010110110011111100111111001111110011111100111111001111110011111100111111001111110101101101011110 3f3f3f3f3f3f3f3f3f5b3f3f3f3f3f3f3f3f3f5b5e
SJIS-WIN 汚??遙??巍??[汚??遙??巍??[^ 100010011001100000111111001111111110101010100001001111110011111110011011110110010011111100111111010110111000100110011000001111110011111111101010101000010011111100111111100110111101100100111111001111110101101101011110 89983f3feaa13f3f9bd93f3f5b89983f3feaa13f3f9bd93f3f5b5e
EUC-JP 汚??遙??巍??[汚??遙??巍??[^ 101100011111100000111111001111111111010010100011001111110011111111010110110110110011111100111111010110111011000111111000001111110011111111110100101000110011111100111111110101101101101100111111001111110101101101011110 b1f83f3ff4a33f3fd6db3f3f5bb1f83f3ff4a33f3fd6db3f3f5b5e
UTF-8 汚좈냽遙볢윝巍띺쳯[汚좈냽遙볢윝巍띺쳯[^ 111001101011000110011010111011001010001010001000111010111000001110111101111010011000000110011001111010111011001110100010111011001001110010011101111001011011011110001101111010111001110110111010111011001011001110101111010110111110011010110001100110101110110010100010100010001110101110000011101111011110100110000001100110011110101110110011101000101110110010011100100111011110010110110111100011011110101110011101101110101110110010110011101011110101101101011110 e6b19aeca288eb83bde98199ebb3a2ec9c9de5b78deb9dbaecb3af5be6b19aeca288eb83bde98199ebb3a2ec9c9de5b78deb9dbaecb3af5b5e
UHC 汚좈냽遙볢윝巍띺쳯[汚좈냽遙볢윝巍띺쳯[^ 111001111111110110100000111010011000011010001101111010011010101110010011111010001001111110100000111010001110010010001101111010011010101110010011010110111110011111111101101000001110100110000110100011011110100110101011100100111110100010011111101000001110100011100100100011011110100110101011100100110101101101011110 e7fda0e9868de9ab93e89fa0e8e48de9ab935be7fda0e9868de9ab93e89fa0e8e48de9ab935b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)