To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????T?????????TB 001111110011111100111111001111110011111100111111001111110011111100111111010101000011111100111111001111110011111100111111001111110011111100111111001111110101010001000010 3f3f3f3f3f3f3f3f3f543f3f3f3f3f3f3f3f3f5442
SJIS-WIN 橈??衣??怨??T橈??衣??怨??TB 100111101111010000111111001111111000100011011111001111110011111110001001100001010011111100111111010101001001111011110100001111110011111110001000110111110011111100111111100010011000010100111111001111110101010001000010 9ef43f3f88df3f3f89853f3f549ef43f3f88df3f3f89853f3f5442
EUC-JP 橈??衣??怨??T橈??衣??怨??TB 110111001111011000111111001111111011000011100001001111110011111110110001111001010011111100111111010101001101110011110110001111110011111110110000111000010011111100111111101100011110010100111111001111110101010001000010 dcf63f3fb0e13f3fb1e53f3f54dcf63f3fb0e13f3fb1e53f3f5442
UTF-8 橈볥돁衣쏙쭓怨뺤젞T橈볥돁衣쏙쭓怨뺤젞TB 111001101010100110001000111010111011001110100101111010111000111110000001111010001010000110100011111011001000111110011001111011001010110110010011111001101000000010101000111010111011101010100100111011001010000010011110010101001110011010101001100010001110101110110011101001011110101110001111100000011110100010100001101000111110110010001111100110011110110010101101100100111110011010000000101010001110101110111010101001001110110010100000100111100101010001000010 e6a988ebb3a5eb8f81e8a1a3ec8f99ecad93e680a8ebbaa4eca09e54e6a988ebb3a5eb8f81e8a1a3ec8f99ecad93e680a8ebbaa4eca09e5442
UHC 橈볥돁衣쏙쭓怨뺤젞T橈볥돁衣쏙쭓怨뺤젞TB 111010001111101010010011111010111000100110010100111010111111110110111101111011111010011110001011111010101011001110010101111011001010000010011000010101001110100011111010100100111110101110001001100101001110101111111101101111011110111110100111100010111110101010110011100101011110110010100000100110000101010001000010 e8fa93eb8994ebfdbdefa78beab395eca09854e8fa93eb8994ebfdbdefa78beab395eca0985442

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)