To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????}?????????{^ 001111110011111100111111001111110011111100111111001111110011111100111111011111010011111100111111001111110011111100111111001111110011111100111111001111110111101101011110 3f3f3f3f3f3f3f3f3f7d3f3f3f3f3f3f3f3f3f7b5e
SJIS-WIN 鼇??泣②?猿??}鼇??泣②?猿??{^ 1110101010000111001111110011111110001011100000111000011101000001001111111000100110001110001111110011111101111101111010101000011100111111001111111000101110000011100001110100000100111111100010011000111000111111001111110111101101011110 ea873f3f8b8387413f898e3f3f7dea873f3f8b8387413f898e3f3f7b5e
EUC-JP 鼇??泣??猿??}鼇??泣??猿??{^ 111100111110011100111111001111111011010111100011001111110011111110110001111011100011111100111111011111011111001111100111001111110011111110110101111000110011111100111111101100011110111000111111001111110111101101011110 f3e73f3fb5e33f3fb1ee3f3f7df3e73f3fb5e33f3fb1ee3f3f7b5e
UTF-8 鼇앸뵃泣②쯁猿낅츍}鼇앸뵃泣②쯁猿낅츍{^ 111010011011110010000111111011001001010110111000111010111011010110000011111001101011001110100011111000101001000110100001111011001010111110000001111001111000110010111111111010111000001010000101111011001011100010001101011111011110100110111100100001111110110010010101101110001110101110110101100000111110011010110011101000111110001010010001101000011110110010101111100000011110011110001100101111111110101110000010100001011110110010111000100011010111101101011110 e9bc87ec95b8ebb583e6b3a3e291a1ecaf81e78cbfeb8285ecb88d7de9bc87ec95b8ebb583e6b3a3e291a1ecaf81e78cbfeb8285ecb88d7b5e
UHC 鼇앸뵃泣②쯁猿낅츍}鼇앸뵃泣②쯁猿낅츍{^ 111010001010100010011101111010111001010010001001111010111110100010101000111010001010100010011101111010101011101110000101111010111010111010001000011111011110100010101000100111011110101110010100100010011110101111101000101010001110100010101000100111011110101010111011100001011110101110101110100010000111101101011110 e8a89deb9489ebe8a8e8a89deabb85ebae887de8a89deb9489ebe8a8e8a89deabb85ebae887b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)