To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????n}?????????n{^ 0011111100111111001111110011111100111111001111110011111100111111001111110110111001111101001111110011111100111111001111110011111100111111001111110011111100111111011011100111101101011110 3f3f3f3f3f3f3f3f3f6e7d3f3f3f3f3f3f3f3f3f6e7b5e
SJIS-WIN 猥??蘊ょ?歪ゆ?n}猥??蘊ょ?歪ゆ?n{^ 111000001100111000111111001111111110010101011101100000101110010100111111100110000110001110000010111001000011111101101110011111011110000011001110001111110011111111100101010111011000001011100101001111111001100001100011100000101110010000111111011011100111101101011110 e0ce3f3fe55d82e53f986382e43f6e7de0ce3f3fe55d82e53f986382e43f6e7b5e
EUC-JP 猥??蘊ょ?歪ゆ?n}猥??蘊ょ?歪ゆ?n{^ 111000001101000000111111001111111110100110111110101001001110011100111111110011111100010010100100111001100011111101101110011111011110000011010000001111110011111111101001101111101010010011100111001111111100111111000100101001001110011000111111011011100111101101011110 e0d03f3fe9bea4e73fcfc4a4e63f6e7de0d03f3fe9bea4e73fcfc4a4e63f6e7b5e
UTF-8 猥덅븡蘊ょ겮歪ゆ뼔n}猥덅븡蘊ょ겮歪ゆ뼔n{^ 1110011110001100101001011110101110001101100001011110101110111000101000011110100010011000100010101110001110000010100001111110101010110010101011101110011010101101101010101110001110000010100001101110101110111100100101000110111001111101111001111000110010100101111010111000110110000101111010111011100010100001111010001001100010001010111000111000001010000111111010101011001010101110111001101010110110101010111000111000001010000110111010111011110010010100011011100111101101011110 e78ca5eb8d85ebb8a1e8988ae38287eab2aee6adaae38286ebbc946e7de78ca5eb8d85ebb8a1e8988ae38287eab2aee6adaae38286ebbc946e7b5e
UHC 猥덅븡蘊ょ겮歪ゆ뼔n}猥덅븡蘊ょ겮歪ゆ뼔n{^ 1110100011100101100010001110100010010101100010101110100010110011101010101110011110000001101111001110100011100000101010101110011010010110100111000110111001111101111010001110010110001000111010001001010110001010111010001011001110101010111001111000000110111100111010001110000010101010111001101001011010011100011011100111101101011110 e8e588e8958ae8b3aae781bce8e0aae6969c6e7de8e588e8958ae8b3aae781bce8e0aae6969c6e7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)