To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???韋??膺??孃る?茵?┃怨??阿?? 0011111100111111001111111110100011101000001111110011111111100100010111100011111100111111100110110110111110000010111010010011111111100100100111110011111110000100101010111000100110000101001111110011111110001000101000100011111100111111 3f3f3fe8e83f3fe45e3f3f9b6f82e93fe49f3f84ab89853f3f88a23f3f
EUC-JP ???韋??膺??孃る?茵?┃怨??阿?? 0011111100111111001111111111000011101010001111110011111111100111101111110011111100111111110101011101000010100100111010110011111111101000101000010011111110101000101011011011000111100101001111110011111110110000101001000011111100111111 3f3f3ff0ea3f3fe7bf3f3fd5d0a4eb3fe8a13fa8adb1e53f3fb0a43f3f
UTF-8 嶺뚣뢿韋뗤튃膺욧콞孃る슡茵낂┃怨몄돶阿숈쾿 111011111010011010101011111010111001101010100011111010111010001010111111111010011001111110001011111010111001011110100100111011011000101010000011111010001000011010111010111011001001101010100111111011001011110110011110111001011010110110000011111000111000001010001011111011001000101010100001111010001000110010110101111010111000001010000010111000101001010010000011111001101000000010101000111010111010101010000100111010111000111110110110111010011001100010111111111011001000100010001000111011001011111010111111 efa6abeb9aa3eba2bfe99f8beb97a4ed8a83e886baec9aa7ecbd9ee5ad83e3828bec8aa1e88cb5eb8282e29483e680a8ebaa84eb8fb6e998bfec8888ecbebf
UHC 嶺뚣뢿韋뗤튃膺욧콞孃る슡茵낂┃怨몄돶阿숈쾿 111001111010110110001100111000111000111110000010111010101101111110001011111001001011100110011001111010111110110010111111111010101011000110010110111001011011111010101010111010111001101010101101111011001110000010000101111010011010011010101101111010101011001110111000111011001000100110111001111001001011100110011001111011001011001010010101 e7ad8ce38f82eadf8be4b999ebecbfeab196e5beaaeb9aadece085e9a6adeab3b8ec89b9e4b999ecb295

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)