To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???泣??儒??窈??鍮??節??阿??? 00111111001111110011111110001011100000110011111100111111100011101111001000111111001111111110001001110111001111110011111111101000010010100011111100111111100100001101111100111111001111111000100010100010001111110011111100111111 3f3f3f8b833f3f8ef23f3fe2773f3fe84a3f3f90df3f3f88a23f3f3f
EUC-JP ???泣??儒??窈??鍮??節??阿??? 00111111001111110011111110110101111000110011111100111111101111001111010000111111001111111110001111011000001111110011111111101111101010110011111100111111110000001110000100111111001111111011000010100100001111110011111100111111 3f3f3fb5e33f3fbcf43f3fe3d83f3fefab3f3fc0e13f3fb0a43f3f3f
UTF-8 捻꿔끆泣앮뵺儒우삅窈띾봿鍮뽩츦節됱녃阿쇡삳룑 111011111010011010100100111010101011111110010100111010111000000110000110111001101011001110100011111011001001010110101110111010111011010110111010111001011000010010010010111011001001101010110000111011001000001010000101111001111010101010001000111010111001110110111110111010111011010010111111111010011000110110101110111010111011110110101001111011001011100010100110111001111010111110000000111010111001000010110001111010111000010110000011111010011001100010111111111011001000011110100001111011001000001010110011111010111010001110010001 efa6a4eabf94eb8186e6b3a3ec95aeebb5bae58492ec9ab0ec8285e7aa88eb9dbeebb4bfe98daeebbda9ecb8a6e7af80eb90b1eb8583e998bfec87a1ec82b3eba391
UHC 捻꿔끆泣앮뵺儒우삅窈띾봿鍮뽩츦節됱녃阿쇡삳룑 1110011011110111101100101110001110000101101110101110101111101000100111011110011010010100101110001110101011100011101111111110110010011000100011001110100110100001100011011110101110010100100001101110101110111001100101101110010110101110100111001110111110111101100010011110110010000110101110111110010010111001100110011100111010111011111010111000111110001110 e6f7b2e385baebe89de694b8eae3bfec988ce9a18deb9486ebb996e5ae9cefbd89ec86bbe4b999cebbeb8f8e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)