To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 弱??厓ц????弱??弱??厓ц?節ワ?弱??B 1000111011100011001111110011111111111010100011011000010010001000001111110011111100111111001111111000111011100011001111110011111110001110111000110011111100111111111110101000110110000100100010000011111110010000110111111000001110001111001111111000111011100011001111110011111101000010 8ee33f3ffa8d84883f3f3f3f8ee33f3f8ee33f3ffa8d84883f90df838f3f8ee33f3f42
EUC-JP 弱??厓ц????弱??弱??厓ц?節ワ?弱??B 10111100111001010011111100111111100011111011010011000111101001111110100000111111001111110011111100111111101111001110010100111111001111111011110011100101001111110011111110001111101101001100011110100111111010000011111111000000111000011010010111101111001111111011110011100101001111110011111101000010 bce53f3f8fb4c7a7e83f3f3f3fbce53f3fbce53f3f8fb4c7a7e83fc0e1a5ef3fbce53f3f42
UTF-8 弱놅숱厓ц춶溫뽳숴弱놅쉠弱놅숱厓ц춷節ワ숴弱놅쉠B 1110010110111100101100011110101110000110100001011110110010001000101100011110010110001110100100111101000110000110111011001011011010110110111001101011101010101011111010111011110110110011111011001000100010110100111001011011110010110001111010111000011010000101111011001000100110100000111001011011110010110001111010111000011010000101111011001000100010110001111001011000111010010011110100011000011011101100101101101011011111100111101011111000000011100011100000111010111111101100100010001011010011100101101111001011000111101011100001101000010111101100100010011010000001000010 e5bcb1eb8685ec88b1e58e93d186ecb6b6e6baabebbdb3ec88b4e5bcb1eb8685ec89a0e5bcb1eb8685ec88b1e58e93d186ecb6b7e7af80e383afec88b4e5bcb1eb8685ec89a042
UHC 弱놅숱厓ц춶溫뽳숴弱놅쉠弱놅숱厓ц춷節ワ숴弱놅쉠B 11100101101100001000011011101111101111011010001011100100111011011010110011101000101011011001001011101000101011101001011011101111101111011010010011100101101100001000011011101111101111011010101011100101101100001000011011101111101111011010001011100100111011011010110011101000101011011001001111101111101111011010101111101111101111011010010011100101101100001000011011101111101111011010101001000010 e5b086efbda2e4edace8ad92e8ae96efbda4e5b086efbdaae5b086efbda2e4edace8ad93efbdabefbda4e5b086efbdaa42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)