To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???泣▽?蹂k?孃る?乳??矣????? 0011111100111111001111111000101110000011100000011010010000111111111001101111100010000010100010110011111110011011011011111000001011101001001111111001001111111011001111110011111111100001111000010011111100111111001111110011111100111111 3f3f3f8b8381a43fe6f8828b3f9b6f82e93f93fb3f3fe1e13f3f3f3f3f
EUC-JP ???泣▽?蹂k?孃る?乳??矣????? 0011111100111111001111111011010111100011101000101010011000111111111011001111101010100011111010110011111111010101110100001010010011101011001111111100011011111101001111110011111111100010111000110011111100111111001111110011111100111111 3f3f3fb5e3a2a63fecfaa3eb3fd5d0a4eb3fc6fd3f3fe2e33f3f3f3f3f
UTF-8 捻꿔끇泣▽슭蹂k눀孃る돆乳몌쬅矣⑸늅說깅뜥 111011111010011010100100111010101011111110010100111010111000000110000111111001101011001110100011111000101001011010111101111011001000101010101101111010001011100110000010111011111011110110001011111010111000100010000000111001011010110110000011111000111000001010001011111010111000111110000110111001001011100110110011111010111010101010001100111011001010110010000101111001111001111110100011111000101001000110111000111010111000101010000101111011111010011010100001111010101011100110000101111010111001110010100101 efa6a4eabf94eb8187e6b3a3e296bdec8aade8b982efbd8beb8880e5ad83e3828beb8f86e4b9b3ebaa8cecac85e79fa3e291b8eb8a85efa6a1eab985eb9ca5
UHC 捻꿔끇泣▽슭蹂k눀孃る돆乳몌쬅矣⑸늅說깅뜥 111001101111011110110010111000111000010110111011111010111110100010100001111001001011110110111110111010111011001110100011111010111000011110100001111001011011111010101010111010111000100110010111111010101110000110111000111011111010011010011100111010111111100010101001111010111011010010111110111001101111001010110001111010111000110110101000 e6f7b2e385bbebe8a1e4bdbeebb3a3eb87a1e5beaaeb8997eae1b8efa69cebf8a9ebb4bee6f2b1eb8da8

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)