To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????n}?????????n{^ 0011111100111111001111110011111100111111001111110011111100111111001111110110111001111101001111110011111100111111001111110011111100111111001111110011111100111111011011100111101101011110 3f3f3f3f3f3f3f3f3f6e7d3f3f3f3f3f3f3f3f3f6e7b5e
SJIS-WIN 猥???畏??誼?n}猥???畏??誼?n{^ 1110000011001110001111110011111100111111100010001101100000111111001111111000101101100010001111110110111001111101111000001100111000111111001111110011111110001000110110000011111100111111100010110110001000111111011011100111101101011110 e0ce3f3f3f88d83f3f8b623f6e7de0ce3f3f3f88d83f3f8b623f6e7b5e
EUC-JP 猥???畏??誼?n}猥???畏??誼?n{^ 1110000011010000001111110011111100111111101100001101101000111111001111111011010111000011001111110110111001111101111000001101000000111111001111110011111110110000110110100011111100111111101101011100001100111111011011100111101101011110 e0d03f3f3fb0da3f3fb5c33f6e7de0d03f3f3fb0da3f3fb5c33f6e7b5e
UTF-8 猥롈살젴畏브퀡誼첦n}猥롈살젴畏브퀡誼첦n{^ 1110011110001100101001011110101110100001100010001110110010000010101101001110110010100000101101001110011110010101100011111110101110111000100011001110110110000000101000011110100010101010101111001110110010110010101001100110111001111101111001111000110010100101111010111010000110001000111011001000001010110100111011001010000010110100111001111001010110001111111010111011100010001100111011011000000010100001111010001010101010111100111011001011001010100110011011100111101101011110 e78ca5eba188ec82b4eca0b4e7958febb88ced80a1e8aabcecb2a66e7de78ca5eba188ec82b4eca0b4e7958febb88ced80a1e8aabcecb2a66e7b5e
UHC 猥롈살젴畏브퀡誼첦n}猥롈살젴畏브퀡誼첦n{^ 1110100011100101100011101100111010111011111011001010000010101000111010001110011010111010111010101011001110010101111010111111111010101011010011110110111001111101111010001110010110001110110011101011101111101100101000001010100011101000111001101011101011101010101100111001010111101011111111101010101101001111011011100111101101011110 e8e58ecebbeca0a8e8e6baeab395ebfeab4f6e7de8e58ecebbeca0a8e8e6baeab395ebfeab4f6e7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)