To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????[??????????[^ 0011111100111111001111110011111100111111001111110011111100111111001111110011111101011011001111110011111100111111001111110011111100111111001111110011111100111111001111110101101101011110 3f3f3f3f3f3f3f3f3f3f5b3f3f3f3f3f3f3f3f3f3f5b5e
SJIS-WIN 縡?怨峯?弔?醍??[縡?怨峯?弔?醍??[^ 111000110111000100111111100010011000010110010101111101010011111110010010101000100011111110010001111001110011111100111111010110111110001101110001001111111000100110000101100101011111010100111111100100101010001000111111100100011110011100111111001111110101101101011110 e3713f898595f53f92a23f91e73f3f5be3713f898595f53f92a23f91e73f3f5b5e
EUC-JP 縡?怨峯?弔?醍??[縡?怨峯?弔?醍??[^ 111001011101001000111111101100011110010111001010111101110011111111000100101001000011111111000010111010010011111100111111010110111110010111010010001111111011000111100101110010101111011100111111110001001010010000111111110000101110100100111111001111110101101101011110 e5d23fb1e5caf73fc4a43fc2e93f3f5be5d23fb1e5caf73fc4a43fc2e93f3f5b5e
UTF-8 縡렕怨峯긺弔렲醍당긺[縡렕怨峯긺弔렲醍당긺[^ 111001111011100010100001111010111010000010010101111001101000000010101000111001011011001110101111111010101011100010111010111001011011110010010100111010111010000010110010111010011000011010001101111010111000101110111001111010101011100010111010010110111110011110111000101000011110101110100000100101011110011010000000101010001110010110110011101011111110101010111000101110101110010110111100100101001110101110100000101100101110100110000110100011011110101110001011101110011110101010111000101110100101101101011110 e7b8a1eba095e680a8e5b3afeab8bae5bc94eba0b2e9868deb8bb9eab8ba5be7b8a1eba095e680a8e5b3afeab8bae5bc94eba0b2e9868deb8bb9eab8ba5b5e
UHC 縡렕怨峯긺弔렲醍당긺[縡렕怨峯긺弔렲醍당긺[^ 11101110101011011000111010101010111010101011001111011100111001111011000111100111111100001100000010001110101111111111000010110101101101001110011110110001111001110101101111101110101011011000111010101010111010101011001111011100111001111011000111100111111100001100000010001110101111111111000010110101101101001110011110110001111001110101101101011110 eead8eaaeab3dce7b1e7f0c08ebff0b5b4e7b1e75beead8eaaeab3dce7b1e7f0c08ebff0b5b4e7b1e75b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)