To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????[?????????[^ 001111110011111100111111001111110011111100111111001111110011111100111111010110110011111100111111001111110011111100111111001111110011111100111111001111110101101101011110 3f3f3f3f3f3f3f3f3f5b3f3f3f3f3f3f3f3f3f5b5e
SJIS-WIN 汚??厓??松у?[汚??厓??松у?[^ 1000100110011000001111110011111111111010100011010011111100111111100011111011110010000100100001010011111101011011100010011001100000111111001111111111101010001101001111110011111110001111101111001000010010000101001111110101101101011110 89983f3ffa8d3f3f8fbc84853f5b89983f3ffa8d3f3f8fbc84853f5b5e
EUC-JP 汚??厓??松у?[汚??厓??松у?[^ 10110001111110000011111100111111100011111011010011000111001111110011111110111110101111101010011111100101001111110101101110110001111110000011111100111111100011111011010011000111001111110011111110111110101111101010011111100101001111110101101101011110 b1f83f3f8fb4c73f3fbebea7e53f5bb1f83f3f8fb4c73f3fbebea7e53f5b5e
UTF-8 汚뉍닾厓녘웻松у뤂[汚뉍닾厓녘웻松у뤂[^ 11100110101100011001101011101011100010011000110111101011100010111011111011100101100011101001001111101011100001011001100011101100100110111011101111100110100111011011111011010001100000111110101110100100100000100101101111100110101100011001101011101011100010011000110111101011100010111011111011100101100011101001001111101011100001011001100011101100100110111011101111100110100111011011111011010001100000111110101110100100100000100101101101011110 e6b19aeb898deb8bbee58e93eb8598ec9bbbe69dbed183eba4825be6b19aeb898deb8bbee58e93eb8598ec9bbbe69dbed183eba4825b5e
UHC 汚뉍닾厓녘웻松у뤂[汚뉍닾厓녘웻松у뤂[^ 111001111111110110000111111000101000100010101100111001001110110110110011111010001001111110000111111000011110011010101100111001011000111110110011010110111110011111111101100001111110001010001000101011001110010011101101101100111110100010011111100001111110000111100110101011001110010110001111101100110101101101011110 e7fd87e288ace4edb3e89f87e1e6ace58fb35be7fd87e288ace4edb3e89f87e1e6ace58fb35b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)