To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???猷??筍ろ?檍??猷??遺??吾?Ⅲ 0011111100111111001111111001011101010001001111110011111111100010101000011000001011101011001111111001111011111000001111110011111110010111010100010011111100111111100010001110001000111111001111111000110011100001001111111000011101010110 3f3f3f97513f3fe2a182eb3f9ef83f3f97513f3f88e23f3f8ce13f8756
EUC-JP ???猷??筍ろ?檍??猷??遺??吾?? 00111111001111110011111111001101101100100011111100111111111001001010001110100100111011010011111111011100111110100011111100111111110011011011001000111111001111111011000011100100001111110011111110111000111000110011111100111111 3f3f3fcdb23f3fe4a3a4ed3fdcfa3f3fcdb23f3fb0e43f3fb8e33f3f
UTF-8 捻뀀툙猷녻굢筍ろ떐檍됱럥猷뤷쫩遺듬짃吾몄Ⅲ 111011111010011010100100111010111000000010000000111011011000100010011001111001111000110010110111111010111000010110111011111010101011010110100010111001111010110110001101111000111000001010001101111010111001011010010000111001101010101010001101111010111001000010110001111010111001111110100101111001111000110010110111111010111010010010110111111011001010101110101001111010011000000110111010111010111001001110101100111011001010011110000011111001011001000010111110111010111010101010000100111000101000010110100010 efa6a4eb8080ed8899e78cb7eb85bbeab5a2e7ad8de3828deb9690e6aa8deb90b1eb9fa5e78cb7eba4b7ecaba9e981baeb93aceca783e590beebaa84e285a2
UHC 捻뀀툙猷녻굢筍ろ떐檍됱럥猷뤷쫩遺듬짃吾몄Ⅲ 111001101111011110110010111010111011100010010000111010111010001110000110111010001000001010001001111000101110110010101010111011011000101110100110111001011110010110001001111011001000111010001000111010111010001110001111111001011010011010000010111010111011011010110101111010111010001110010011111001111110111010111000111011001010010110110010 e6f7b2ebb890eba386e88289e2ecaaed8ba6e5e589ec8e88eba38fe5a682ebb6b5eba393e7eeb8eca5b2

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)