To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 鬱氓??繭蟻??坤鬱氓??繭蟻??鵠^ 1001111101010100100111111000001000111111001111111001011010011010100010110110000100111111001111111000110110100011100111110101010010011111100000100011111100111111100101101001101010001011011000010011111100111111100011011001010001011110 9f549f823f3f969a8b613f3f8da39f549f823f3f969a8b613f3f8d945e
EUC-JP 鬱氓??繭蟻??坤鬱氓??繭蟻??鵠^ 1101110110110101110111011110001000111111001111111100101111111010101101011100001000111111001111111011101010100101110111011011010111011101111000100011111100111111110010111111101010110101110000100011111100111111101110011111010001011110 ddb5dde23f3fcbfab5c23f3fbaa5ddb5dde23f3fcbfab5c23f3fb9f45e
UTF-8 鬱氓렮댄繭蟻뀌렫坤鬱氓렮댄繭蟻뀌렫鵠^ 11101001101011001011000111100110101100001001001111101011101000001010111011101011100011001000010011100111101110011010110111101000100111111011101111101011100000001000110011101011101000001010101111100101100111011010010011101001101011001011000111100110101100001001001111101011101000001010111011101011100011001000010011100111101110011010110111101000100111111011101111101011100000001000110011101011101000001010101111101001101101011010000001011110 e9acb1e6b093eba0aeeb8c84e7b9ade89fbbeb808ceba0abe59da4e9acb1e6b093eba0aeeb8c84e7b9ade89fbbeb808ceba0abe9b5a05e
UHC 鬱氓렮댄繭蟻뀌렫坤鬱氓렮댄繭蟻뀌렫鵠^ 11101010101001101101100011101100100011101011101110110100111011011100110010110110111010111111110010110010111011101000111010111001110011011101111011101010101001101101100011101100100011101011101110110100111011011100110010110110111010111111110010110010111011101000111010111001110011011101110001011110 eaa6d8ec8ebbb4edccb6ebfcb2ee8eb9cddeeaa6d8ec8ebbb4edccb6ebfcb2ee8eb9cddc5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)