To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????~??????????~?????B 0011111100111111001111110011111100111111011111100011111100111111001111110011111100111111001111110011111100111111001111110011111101111110001111110011111100111111001111110011111101000010 3f3f3f3f3f7e3f3f3f3f3f3f3f3f3f3f7e3f3f3f3f3f42
SJIS-WIN ?丹?ぱ∫~??悠ビ??丹?ぱ∫~??悠ビ?B 001111111001001001001111001111111000001011001111100000011110011101111110001111110011111110010111010010011000001101110010001111110011111110010010010011110011111110000010110011111000000111100111011111100011111100111111100101110100100110000011011100100011111101000010 3f924f3f82cf81e77e3f3f974983723f3f924f3f82cf81e77e3f3f974983723f42
EUC-JP ?丹?ぱ∫~??悠ビ??丹?ぱ∫~??悠ビ?B 001111111100001110110000001111111010010011010001101000101110100101111110001111110011111111001101101010101010010111010011001111110011111111000011101100000011111110100100110100011010001011101001011111100011111100111111110011011010101010100101110100110011111101000010 3fc3b03fa4d1a2e97e3f3fcdaaa5d33f3fc3b03fa4d1a2e97e3f3fcdaaa5d33f42
UTF-8 룶丹춳ぱ∫~룵에悠ビ∼룶丹춳ぱ∫~룵에悠ビ∼B 111010111010001110110110111001001011100010111001111011001011011010110011111000111000000110110001111000101000100010101011011111101110101110100011101101011110110010010111100100001110011010000010101000001110001110000011100100111110001010001000101111001110101110100011101101101110010010111000101110011110110010110110101100111110001110000001101100011110001010001000101010110111111011101011101000111011010111101100100101111001000011100110100000101010000011100011100000111001001111100010100010001011110001000010 eba3b6e4b8b9ecb6b3e381b1e288ab7eeba3b5ec9790e682a0e38393e288bceba3b6e4b8b9ecb6b3e381b1e288ab7eeba3b5ec9790e682a0e38393e288bc42
UHC 룶丹춳ぱ∫~룵에悠ビ∼룶丹춳ぱ∫~룵에悠ビ∼B 10001111101010111101001110100001101011011000111110101010110100011010000111110010011111101000111110101010101111111010000111101010111011011010101111010011101000011010110110001111101010111101001110100001101011011000111110101010110100011010000111110010011111101000111110101010101111111010000111101010111011011010101111010011101000011010110101000010 8fabd3a1ad8faad1a1f27e8faabfa1eaedabd3a1ad8fabd3a1ad8faad1a1f27e8faabfa1eaedabd3a1ad42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)