To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 猥?????楡?????猥?????楡?????B 1110000011001110001111110011111100111111001111110011111110011110101111100011111100111111001111110011111100111111111000001100111000111111001111110011111100111111001111111001111010111110001111110011111100111111001111110011111101000010 e0ce3f3f3f3f3f9ebe3f3f3f3f3fe0ce3f3f3f3f3f9ebe3f3f3f3f3f42
EUC-JP 猥??堉??楡?????猥??堉??楡?????B 111000001101000000111111001111111000111110110111111111010011111100111111110111001100000000111111001111110011111100111111001111111110000011010000001111110011111110001111101101111111110100111111001111111101110011000000001111110011111100111111001111110011111101000010 e0d03f3f8fb7fd3f3fdcc03f3f3f3f3fe0d03f3f8fb7fd3f3fdcc03f3f3f3f3f42
UTF-8 猥롪퍔堉뤸레楡년꼻嶺띾샄猥롪퍔堉뤸레楡년꼻嶺띾샄B 11100111100011001010010111101011101000011010101011101101100011011001010011100101101000001000100111101011101001001011100011101011101000001000100011100110101001011010000111101011100001011000010011101010101111001011101111101111101001101010101111101011100111011011111011101100100000111000010011100111100011001010010111101011101000011010101011101101100011011001010011100101101000001000100111101011101001001011100011101011101000001000100011100110101001011010000111101011100001011000010011101010101111001011101111101111101001101010101111101011100111011011111011101100100000111000010001000010 e78ca5eba1aaed8d94e5a089eba4b8eba088e6a5a1eb8584eabcbbefa6abeb9dbeec8384e78ca5eba1aaed8d94e5a089eba4b8eba088e6a5a1eb8584eabcbbefa6abeb9dbeec838442
UHC 猥롪퍔堉뤸레楡년꼻嶺띾샄猥롪퍔堉뤸레楡년꼻嶺띾샄B 11101000111001011000111011101010101110111000101111101011101111001000111111100110101101111011100111101010111110001011001111100010100001001001001111100111101011011000110111101011100110001011011011101000111001011000111011101010101110111000101111101011101111001000111111100110101101111011100111101010111110001011001111100010100001001001001111100111101011011000110111101011100110001011011001000010 e8e58eeabb8bebbc8fe6b7b9eaf8b3e28493e7ad8deb98b6e8e58eeabb8bebbc8fe6b7b9eaf8b3e28493e7ad8deb98b642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)