To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????v????vB 0011111100111111001111110011111101110110001111110011111100111111001111110111011001000010 3f3f3f3f763f3f3f3f7642
SJIS-WIN ???詭v???詭vB 00111111001111110011111111100110011010110111011000111111001111110011111111100110011010110111011001000010 3f3f3fe66b763f3f3fe66b7642
EUC-JP ???詭v???詭vB 00111111001111110011111111101011110011000111011000111111001111110011111111101011110011000111011001000010 3f3f3febcc763f3f3febcc7642
UTF-8 黎앸강詭v黎앸강詭vB 111011111010011010001001111011001001010110111000111010101011000010010101111010001010100110101101011101101110111110100110100010011110110010010101101110001110101010110000100101011110100010101001101011010111011001000010 efa689ec95b8eab095e8a9ad76efa689ec95b8eab095e8a9ad7642
UHC 黎앸강詭v黎앸강詭vB 11100110101100011001110111101011101100001010110111001111111110000111011011100110101100011001110111101011101100001010110111001111111110000111011001000010 e6b19debb0adcff876e6b19debb0adcff87642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)