To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????v????vB 0011111100111111001111110011111101110110001111110011111100111111001111110111011001000010 3f3f3f3f763f3f3f3f7642
SJIS-WIN 上瘤ス酌v上瘤ス酌vB 1000111111100011111000011000111010111101100011101101111001110110100011111110001111100001100011101011110110001110110111100111011001000010 8fe3e18ebd8ede768fe3e18ebd8ede7642
EUC-JP 上瘤ス酌v上瘤ス酌vB 10111110111001011110000111101110100011101011110110111100111000000111011010111110111001011110000111101110100011101011110110111100111000000111011001000010 bee5e1ee8ebdbce076bee5e1ee8ebdbce07642
UTF-8 上瘤ス酌v上瘤ス酌vB 111001001011100010001010111001111001100010100100111011111011110110111101111010011000010110001100011101101110010010111000100010101110011110011000101001001110111110111101101111011110100110000101100011000111011001000010 e4b88ae798a4efbdbde9858c76e4b88ae798a4efbdbde9858c7642
UHC 上瘤?酌v上瘤?酌vB 1101111110111110110101111011101100111111111011011100110001110110110111111011111011010111101110110011111111101101110011000111011001000010 dfbed7bb3fedcc76dfbed7bb3fedcc7642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)