To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????[?????????[^ 001111110011111100111111001111110011111100111111001111110011111100111111010110110011111100111111001111110011111100111111001111110011111100111111001111110101101101011110 3f3f3f3f3f3f3f3f3f5b3f3f3f3f3f3f3f3f3f5b5e
SJIS-WIN 蒻る????釉??[蒻る????釉??[^ 111001001110100010000010111010010011111100111111001111110011111111100111110101100011111100111111010110111110010011101000100000101110100100111111001111110011111100111111111001111101011000111111001111110101101101011110 e4e882e93f3f3f3fe7d63f3f5be4e882e93f3f3f3fe7d63f3f5b5e
EUC-JP 蒻る?彛??釉??[蒻る?彛??釉??[^ 11101000111010101010010011101011001111111000111110111100111110100011111100111111111011101101100000111111001111110101101111101000111010101010010011101011001111111000111110111100111110100011111100111111111011101101100000111111001111110101101101011110 e8eaa4eb3f8fbcfa3f3feed83f3f5be8eaa4eb3f8fbcfa3f3feed83f3f5b5e
UTF-8 蒻る툦彛끻땔釉앹춦[蒻る툦彛끻땔釉앹춦[^ 111010001001001010111011111000111000001010001011111011011000100010100110111001011011110110011011111010111000000110111011111010111001010110010100111010011000011110001001111011001001010110111001111011001011011010100110010110111110100010010010101110111110001110000010100010111110110110001000101001101110010110111101100110111110101110000001101110111110101110010101100101001110100110000111100010011110110010010101101110011110110010110110101001100101101101011110 e892bbe3828bed88a6e5bd9beb81bbeb9594e98789ec95b9ecb6a65be892bbe3828bed88a6e5bd9beb81bbeb9594e98789ec95b9ecb6a65b5e
UHC 蒻る툦彛끻땔釉앹춦[蒻る툦彛끻땔釉앹춦[^ 111001011011011010101010111010111011100010011101111011001010110110000101111001011011011010101010111010111011100010011101111011001010110110000101010110111110010110110110101010101110101110111000100111011110110010101101100001011110010110110110101010101110101110111000100111011110110010101101100001010101101101011110 e5b6aaebb89decad85e5b6aaebb89decad855be5b6aaebb89decad85e5b6aaebb89decad855b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)