To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????n}?????????n{^ 0011111100111111001111110011111100111111001111110011111100111111001111110110111001111101001111110011111100111111001111110011111100111111001111110011111100111111011011100111101101011110 3f3f3f3f3f3f3f3f3f6e7d3f3f3f3f3f3f3f3f3f6e7b5e
SJIS-WIN 汝??譽??隘??n}汝??譽??隘??n{^ 1001001111110000001111110011111111100110101000110011111100111111111010001010010100111111001111110110111001111101100100111111000000111111001111111110011010100011001111110011111111101000101001010011111100111111011011100111101101011110 93f03f3fe6a33f3fe8a53f3f6e7d93f03f3fe6a33f3fe8a53f3f6e7b5e
EUC-JP 汝??譽??隘??n}汝??譽??隘??n{^ 1100011011110010001111110011111111101100101001010011111100111111111100001010011100111111001111110110111001111101110001101111001000111111001111111110110010100101001111110011111111110000101001110011111100111111011011100111101101011110 c6f23f3feca53f3ff0a73f3f6e7dc6f23f3feca53f3ff0a73f3f6e7b5e
UTF-8 汝싩궘譽긷뜏隘녽걶n}汝싩궘譽긷뜏隘녽걶n{^ 1110011010110001100111011110110010001011101010011110101010110110100110001110100010101101101111011110101010111000101101111110101110011100100011111110100110011010100110001110101110000101101111011110101010110001101101100110111001111101111001101011000110011101111011001000101110101001111010101011011010011000111010001010110110111101111010101011100010110111111010111001110010001111111010011001101010011000111010111000010110111101111010101011000110110110011011100111101101011110 e6b19dec8ba9eab698e8adbdeab8b7eb9c8fe99a98eb85bdeab1b66e7de6b19dec8ba9eab698e8adbdeab8b7eb9c8fe99a98eb85bdeab1b66e7b5e
UHC 汝싩궘譽긷뜏隘녽걶n}汝싩궘譽긷뜏隘녽걶n{^ 1110011010100011100110101110011110000010101011011110011111100010101100011110010110001101100100101110010011110110100001101110100110000001100111000110111001111101111001101010001110011010111001111000001010101101111001111110001010110001111001011000110110010010111001001111011010000110111010011000000110011100011011100111101101011110 e6a39ae782ade7e2b1e58d92e4f686e9819c6e7de6a39ae782ade7e2b1e58d92e4f686e9819c6e7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)