To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???援??邑??筌??油??淞る?藥?? 00111111001111110011111110001001100001110011111100111111100101110101011100111111001111111110001010100011001111110011111110010110111110110011111100111111100111111100001010000010111010010011111111100101010110100011111100111111 3f3f3f89873f3f97573f3fe2a33f3f96fb3f3f9fc282e93fe55a3f3f
EUC-JP ???援??邑??筌??油??淞る?藥?? 00111111001111110011111110110001111001110011111100111111110011011011100000111111001111111110010010100101001111110011111111001100111111010011111100111111110111101100010010100100111010110011111111101001101110110011111100111111 3f3f3fb1e73f3fcdb83f3fe4a53f3fccfd3f3fdec4a4eb3fe9bb3f3f
UTF-8 捻뀁슜援앶솻邑㏃뒙筌먥돦油뤹뮫淞る닰藥뀁덩 111011111010011010100100111010111000000010000001111011001000101010011100111001101000111110110100111011001001010110110110111011001000011010111011111010011000001010010001111000111000111110000011111010111001001010011001111001111010110110001100111010111010100010100101111010111000111110100110111001101011001010111001111010111010010010111001111010111010111010101011111001101011011110011110111000111000001010001011111010111000101110110000111010001001011110100101111010111000000010000001111010111000110110101001 efa6a4eb8081ec8a9ce68fb4ec95b6ec86bbe98291e38f83eb9299e7ad8ceba8a5eb8fa6e6b2b9eba4b9ebaeabe6b79ee3828beb8bb0e897a5eb8081eb8da9
UHC 捻뀁슜援앶솻邑㏃뒙筌먥돦油뤹뮫淞る닰藥뀁덩 111001101111011110110010111011001001101010101001111010101011010110011101111010011001100110110000111010111110100110100111111011001000101010010110111011111010011110010000111000101000100110101010111010101111101010001111111001111001001010110101111000011110011110101010111010111000100010100110111001011011011110110010111011001011010110100010 e6f7b2ec9aa9eab59de999b0ebe9a7ec8a96efa790e289aaeafa8fe792b5e1e7aaeb88a6e5b7b2ecb5a2

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)