To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN ???艶??閻わ?箋??閻ょ????絶?『^ 001111110011111100111111100010011001000000111111001111111110100010000101100000101110110100111111111000101011001100111111001111111110100010000101100000101110010100111111001111110011111100111111100100001110001000111111100000010111011101011110 3f3f3f89903f3fe88582ed3fe2b33f3fe88582e53f3f3f3f90e23f81775e
EUC-JP ??˚艶??閻わ?箋??閻ょ???˚絶?『^ 00111111001111111000111110100010101101101011000111110000001111110011111111101111111001011010010011101111001111111110010010110101001111110011111111101111111001011010010011100111001111110011111100111111100011111010001010110110110000001110010000111111101000011101100001011110 3f3f8fa2b6b1f03f3fefe5a4ef3fe4b53f3fefe5a4e73f3f3f8fa2b6c0e43fa1d85e
UTF-8 劣쀧˚艶쇘뜔閻わ풜箋섊뜔閻ょ읆劣쀧˚絶잏『^ 1110111110100110100111011110110010000000101001111100101110011010111010001000100110110110111011001000011110011000111010111001110010010100111010011001011010111011111000111000001010001111111011011001001010011100111001111010111010001011111011001000010010001010111010111001110010010100111010011001011010111011111000111000001010000111111011001001110110000110111011111010011010011101111011001000000010100111110010111001101011100111101101011011011011101100100111101000111111100011100000001000111001011110 efa69dec80a7cb9ae889b6ec8798eb9c94e996bbe3828fed929ce7ae8bec848aeb9c94e996bbe38287ec9d86efa69dec80a7cb9ae7b5b6ec9e8fe3808e5e
UHC 劣쀧˚艶쇘뜔閻わ풜箋섊뜔閻ょ읆劣쀧˚絶잏『^ 11100110111010111001011111100111101000101010101011100110111111011011110011100111100011011001011111100111101000101010101011101111101111101001111111101111101010001001100011100111100011011001011111100111101000101010101011100111100111111011110011100110111010111001011111100111101000101010101011101111101111101001111111100111101000011011101001011110 e6eb97e7a2aae6fdbce78d97e7a2aaefbe9fefa898e78d97e7a2aae79fbce6eb97e7a2aaefbe9fe7a1ba5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)