To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????Lh??????L 001111110011111100111111001111110011111100111111010011000110100000111111001111110011111100111111001111110011111101001100 3f3f3f3f3f3f4c683f3f3f3f3f3f4c
SJIS-WIN 蕭h音鐔緒週Lh蕭h音鐔緒週L 111001010100101010000010100010001000100110111001111010000101110010001111100011111000111101010100010011000110100011100101010010101000001010001000100010011011100111101000010111001000111110001111100011110101010001001100 e54a828889b9e85c8f8f8f544c68e54a828889b9e85c8f8f8f544c
EUC-JP 蕭h音鐔緒週Lh蕭h音鐔緒週L 111010011010101110100011111010001011001010111011111011111011110110111101111011111011110110110101010011000110100011101001101010111010001111101000101100101011101111101111101111011011110111101111101111011011010101001100 e9aba3e8b2bbefbdbdefbdb54c68e9aba3e8b2bbefbdbdefbdb54c
UTF-8 蕭h音鐔緒週Lh蕭h音鐔緒週L 111010001001010110101101111011111011110110001000111010011001111110110011111010011001000010010100111001111011011110010010111010011000000010110001010011000110100011101000100101011010110111101111101111011000100011101001100111111011001111101001100100001001010011100111101101111001001011101001100000001011000101001100 e895adefbd88e99fb3e99094e7b792e980b14c68e895adefbd88e99fb3e99094e7b792e980b14c
UHC 蕭h音??週Lh蕭h音??週L 1110000111001011101000111110100011101011111001010011111100111111111100011100111001001100011010001110000111001011101000111110100011101011111001010011111100111111111100011100111001001100 e1cba3e8ebe53f3ff1ce4c68e1cba3e8ebe53f3ff1ce4c

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)