To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 誤???沃??椅??恂ル?阿??援θぜ 10001100111010110011111100111111001111111001011110000000001111110011111110001000110101100011111100111111100111001001011010000011100010110011111110001000101000100011111100111111100010011000011110000011110001101000001010111010 8ceb3f3f3f97803f3f88d63f3f9c96838b3f88a23f3f898783c682ba
EUC-JP 誤???沃??椅??恂ル?阿??援θぜ 10111000111011010011111100111111001111111100110111100000001111110011111110110000110110000011111100111111110101111111011010100101111010110011111110110000101001000011111100111111101100011110011110100110110010001010010010111100 b8ed3f3f3fcde03f3fb0d83f3fd7f6a5eb3fb0a43f3fb1e7a6c8a4bc
UTF-8 誤곸룆큔沃쇈깺椅먩윍恂ル츘阿잕풝援θぜ 1110100010101010101001001110101010110011101110001110101110100011100001101110110110000001100101001110011010110010100000111110110010000111100010001110101010111001101110101110011010100100100001011110101110101000101010011110110010011100100011011110011010000001100000101110001110000011101010111110110010111000100110001110100110011000101111111110110010011110100101011110110110010010100111011110011010001111101101001100111010111000111000111000000110011100 e8aaa4eab3b8eba386ed8194e6b283ec8788eab9bae6a485eba8a9ec9c8de68182e383abecb898e998bfec9e95ed929de68fb4ceb8e3819c
UHC 誤곸룆큔沃쇈깺椅먩윍恂ル츘阿잕풝援θぜ 1110100010100110100000011110110010001111100001011100010110100110111010001010101010111100111000111000001110100110111010111111010110010000111001101001111110010100111000101110000110101011111010111010111010010010111001001011100110011111111010101011111010100000111010101011010110100101111010001010101010111100 e8a681ec8f85c5a6e8aabce383a6ebf590e69f94e2e1abebae92e4b99feabea0eab5a5e8aabc

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)