To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???誼??袁る?筌??誼??臾?????移 0011111100111111001111111000101101100010001111110011111111100101110011011000001011101001001111111110001010100011001111110011111110001011011000100011111100111111111001000110101100111111001111110011111100111111001111111000100011011010 3f3f3f8b623f3fe5cd82e93fe2a33f3f8b623f3fe46b3f3f3f3f3f88da
EUC-JP ???誼??袁る?筌??誼??臾?????移 0011111100111111001111111011010111000011001111110011111111101010110011111010010011101011001111111110010010100101001111110011111110110101110000110011111100111111111001111100110000111111001111110011111100111111001111111011000011011100 3f3f3fb5c33f3feacfa4eb3fe4a53f3fb5c33f3fe7cc3f3f3f3f3fb0dc
UTF-8 劣꾨챶誼썼짅袁る즷筌뗪퀡誼뚥틪臾먯졄嶺뚮돃移 111011111010011010011101111010101011111010101000111011001011000110110110111010001010101010111100111011001000110110111100111011001010011110000101111010001010001010000001111000111000001010001011111011001010011010110111111001111010110110001100111010111001011110101010111011011000000010100001111010001010101010111100111010111001101010100101111011011000101110101010111010001000011110111110111010111010100010101111111011001010000110000100111011111010011010101011111010111001101010101110111010111000111110000011111001111010011110111011 efa69deabea8ecb1b6e8aabcec8dbceca785e8a281e3828beca6b7e7ad8ceb97aaed80a1e8aabceb9aa5ed8baae887beeba8afeca184efa6abeb9aaeeb8f83e7a7bb
UHC 劣꾨챶誼썼짅袁る즷筌뗪퀡誼뚥틪臾먯졄嶺뚮돃移 1110011011101011100001001110101110101010100000111110101111111110101111011110100010100011100101001110101010111110101010101110101110100011100010011110111110100111100010111110101010110011100101011110101111111110100011001110010010111010100101001110101110101100100100001110110010100000101101011110011110101101100011001110101110001001100101101110110010111001 e6eb84ebaa83ebfebde8a394eabeaaeba389efa78beab395ebfe8ce4ba94ebac90eca0b5e7ad8ceb8996ecb9

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)