To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????????? 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ??????淫?????弛????????肉? 0011111100111111001111110011111100111111001111111000100011111010001111110011111100111111001111110011111110010010011011110011111100111111001111110011111100111111001111110011111100111111100100111111011100111111 3f3f3f3f3f3f88fa3f3f3f3f3f926f3f3f3f3f3f3f3f3f93f73f
EUC-JP ??????淫?????弛????????肉? 0011111100111111001111110011111100111111001111111011000011111100001111110011111100111111001111110011111111000011110100000011111100111111001111110011111100111111001111110011111100111111110001101111100100111111 3f3f3f3f3f3fb0fc3f3f3f3f3fc3d03f3f3f3f3f3f3f3fc6f93f
UTF-8 捻뀁룄鱗들렟淫딅룆濾낆옚弛꾣틦流곷츏麗몃씈肉쒫 111011111010011010100100111010111000000010000001111010111010001110000100111011111010011110110010111010111001001110100100111010111010000010011111111001101011011110101011111010111001010010000101111010111010001110000110111011111010011010000100111010111000001010000110111011001001100010011010111001011011110010011011111010101011111010100011111011011000101110100110111011111010011110001010111010101011001110110111111011001011100010001111111011111010011010001000111010111010101010000011111011001001010010001000111010001000001010001001111011001001001010101011 efa6a4eb8081eba384efa7b2eb93a4eba09fe6b7abeb9485eba386efa684eb8286ec989ae5bc9beabea3ed8ba6efa78aeab3b7ecb88fefa688ebaa83ec9488e88289ec92ab
UHC 捻뀁룄鱗들렟淫딅룆濾낆옚弛꾣틦流곷츏麗몃씈肉쒫 11100110111101111011001011101100100011111000010011101100111001111011010111101001100011101011000011101011111000101000101011101011100011111000010111100110101001001000010111101100100111101001111011101100101011001000010011100110101110101001000011101010111111001000000111101011101011101000101011100110101100001011100011101011100111011010000011101011101111111001110101000010 e6f7b2ec8f84ece7b5e98eb0ebe28aeb8f85e6a485ec9e9eecac84e6ba90eafc81ebae8ae6b0b8eb9da0ebbf9d42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)