To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????? 001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f
SJIS-WIN ???孺??惟れ? 001111110011111100111111100110110111110100111111001111111000100011010010100000101110101000111111 3f3f3f9b7d3f3f88d282ea3f
EUC-JP 濚??孺??惟れ? 1000111111001001101000010011111100111111110101011101111000111111001111111011000011010100101001001110110000111111 8fc9a13f3fd5de3f3fb0d4a4ec3f
UTF-8 濚밤꺃孺쇽쭓惟れ쉔 111001101011111110011010111010111011000010100100111010101011101010000011111001011010110110111010111011001000011110111101111011001010110110010011111001101000001110011111111000111000001010001100111011001000100110010100 e6bf9aebb0a4eaba83e5adbaec87bdecad93e6839fe3828cec8994
UHC 濚밤꺃孺쇽쭓惟れ쉔 111001111011100110111001111000111000001110101100111010101110100010111100111011111010011110001011111010101110111010101010111011001011110110101000 e7b9b9e383aceae8bcefa78beaeeaaecbda8

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)