To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 鶯??楢??攸??檍??淫?┸???壤?? 11101001111100100011111100111111100100111110100000111111001111111001110110111111001111110011111110011110111110000011111100111111100010001111101000111111100001001011110100111111001111110011111110011010110111110011111100111111 e9f23f3f93e83f3f9dbf3f3f9ef83f3f88fa3f84bd3f3f3f9adf3f3f
EUC-JP 鶯??楢??攸??檍??淫?┸洧??壤?? 111100101111010000111111001111111100011011101010001111110011111111011010110000010011111100111111110111001111101000111111001111111011000011111100001111111010100010111111100011111100011110110100001111110011111111010100111000010011111100111111 f2f43f3fc6ea3f3fdac13f3fdcfa3f3fb0fc3fa8bf8fc7b43f3fd4e13f3f
UTF-8 鶯볤쑬楢욘뿥攸곷뎨檍됰㈇淫먲┸洧얜쿋壤쏅쥒 111010011011011010101111111010111011001110100100111011001001000110101100111001101010010110100010111011001001101010011000111010111011111110100101111001101001010010111000111010101011001110110111111010111000111010101000111001101010101010001101111010111001000010110000111000111000100010000111111001101011011110101011111010111010100010110010111000101001010010111000111001101011010010100111111011001001011010011100111011001011111110001011111001011010001110100100111011001000111110000101111011001010010110010010 e9b6afebb3a4ec91ace6a5a2ec9a98ebbfa5e694b8eab3b7eb8ea8e6aa8deb90b0e38887e6b7abeba8b2e294b8e6b4a7ec969cecbf8be5a3a4ec8f85eca592
UHC 鶯볤쑬楢욘뿥攸곷뎨檍됰㈇淫먲┸洧얜쿋壤쏅쥒 111001011010001110010011111010101011111010101000111010101111100110111111111001101001011110100101111010101111001010000001111010111011010110110011111001011110010110001001111010111010100110111000111010111110001010010000111011111010011010111111111010101111101110111110111010111011001010100000111001011011110110011011111010111010001010001001 e5a393eabea8eaf9bfe697a5eaf281ebb5b3e5e589eba9b8ebe290efa6bfeafbbeebb2a0e5bd9beba289

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)