To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????v????vB 0011111100111111001111110011111101110110001111110011111100111111001111110111011001000010 3f3f3f3f763f3f3f3f7642
SJIS-WIN 硝踴章イv硝踴章イvB 1000111111001001111001101111101110001111110011011011001001110110100011111100100111100110111110111000111111001101101100100111011001000010 8fc9e6fb8fcdb2768fc9e6fb8fcdb27642
EUC-JP 硝踴章イv硝踴章イvB 10111110110010111110110011111101101111101100111110001110101100100111011010111110110010111110110011111101101111101100111110001110101100100111011001000010 becbecfdbecf8eb276becbecfdbecf8eb27642
UTF-8 硝踴章イv硝踴章イvB 111001111010000110011101111010001011100010110100111001111010101110100000111011111011110110110010011101101110011110100001100111011110100010111000101101001110011110101011101000001110111110111101101100100111011001000010 e7a19de8b8b4e7aba0efbdb276e7a19de8b8b4e7aba0efbdb27642
UHC 硝?章?v硝?章?vB 111101011010011000111111111011011111000100111111011101101111010110100110001111111110110111110001001111110111011001000010 f5a63fedf13f76f5a63fedf13f7642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)