To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ??????擁??霓 001111110011111100111111001111110011111100111111100101110110100100111111001111111110100010111101 3f3f3f3f3f3f97693f3fe8bd
EUC-JP ??????擁??霓 001111110011111100111111001111110011111100111111110011011100101000111111001111111111000010111111 3f3f3f3f3f3fcdca3f3ff0bf
UTF-8 遼깁옩溜뺣젙擁얜젔霓 111011111010011110000011111010101011100110000001111011001001100010101001111011111010011110001011111010111011101010100011111011001010000010011001111001101001001110000001111011001001011010011100111011001010000010010100111010011001110010010011 efa783eab981ec98a9efa78bebbaa3eca099e69381ec969ceca094e99c93
UHC 遼깁옩溜뺣젙擁얜젔霓 1110100110101100101100011110100110011110101010001110101011111110100101011110101110100000100101011110100010110110101111101110101110100000100100101110011111100111 e9acb1e99ea8eafe95eba095e8b6beeba092e7e7

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)