To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????N}?????????N{^ 0011111100111111001111110011111100111111001111110011111100111111001111110100111001111101001111110011111100111111001111110011111100111111001111110011111100111111010011100111101101011110 3f3f3f3f3f3f3f3f3f4e7d3f3f3f3f3f3f3f3f3f4e7b5e
SJIS-WIN 嗚??辱ヨ?狎??N}嗚??辱ヨ?狎??N{^ 10011010011010100011111100111111100100000100101010000011100010000011111111100000101111100011111100111111010011100111110110011010011010100011111100111111100100000100101010000011100010000011111111100000101111100011111100111111010011100111101101011110 9a6a3f3f904a83883fe0be3f3f4e7d9a6a3f3f904a83883fe0be3f3f4e7b5e
EUC-JP 嗚??辱ヨ?狎??N}嗚??辱ヨ?狎??N{^ 11010011110010110011111100111111101111111010101110100101111010000011111111100000110000000011111100111111010011100111110111010011110010110011111100111111101111111010101110100101111010000011111111100000110000000011111100111111010011100111101101011110 d3cb3f3fbfaba5e83fe0c03f3f4e7dd3cb3f3fbfaba5e83fe0c03f3f4e7b5e
UTF-8 嗚잌넍辱ヨ뮓狎쇿츒N}嗚잌넍辱ヨ뮓狎쇿츒N{^ 1110010110010111100110101110110010011110100011001110101110000100100011011110100010111110101100011110001110000011101010001110101110101110100100111110011110001011100011101110110010000111101111111110110010111000100100100100111001111101111001011001011110011010111011001001111010001100111010111000010010001101111010001011111010110001111000111000001110101000111010111010111010010011111001111000101110001110111011001000011110111111111011001011100010010010010011100111101101011110 e5979aec9e8ceb848de8beb1e383a8ebae93e78b8eec87bfecb8924e7de5979aec9e8ceb848de8beb1e383a8ebae93e78b8eec87bfecb8924e7b5e
UHC 嗚잌넍辱ヨ뮓狎쇿츒N}嗚잌넍辱ヨ뮓狎쇿츒N{^ 1110011111110000100111111110010110000110100110011110100110110100101010111110100010010010100111111110010011100100100110011110010110101110100011010100111001111101111001111111000010011111111001011000011010011001111010011011010010101011111010001001001010011111111001001110010010011001111001011010111010001101010011100111101101011110 e7f09fe58699e9b4abe8929fe4e499e5ae8d4e7de7f09fe58699e9b4abe8929fe4e499e5ae8d4e7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)