To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????}v??????????}vB 00111111001111110011111100111111001111110011111100111111001111110011111100111111011111010111011000111111001111110011111100111111001111110011111100111111001111110011111100111111011111010111011001000010 3f3f3f3f3f3f3f3f3f3f7d763f3f3f3f3f3f3f3f3f3f7d7642
SJIS-WIN 長?再??牆?牆橋悼}v長?再??牆?牆橋悼}vB 10010010101101110011111110001101110001000011111100111111111000001010110100111111111000001010110110001011101101001001001110001001011111010111011010010010101101110011111110001101110001000011111100111111111000001010110100111111111000001010110110001011101101001001001110001001011111010111011001000010 92b73f8dc43f3fe0ad3fe0ad8bb493897d7692b73f8dc43f3fe0ad3fe0ad8bb493897d7642
EUC-JP 長?再??牆?牆橋悼}v長?再??牆?牆橋悼}vB 11000100101110010011111110111010110001100011111100111111111000001010111100111111111000001010111110110110101101101100010111101001011111010111011011000100101110010011111110111010110001100011111100111111111000001010111100111111111000001010111110110110101101101100010111101001011111010111011001000010 c4b93fbac63f3fe0af3fe0afb6b6c5e97d76c4b93fbac63f3fe0af3fe0afb6b6c5e97d7642
UTF-8 長렮再쾡벵牆렓牆橋悼}v長렮再쾡벵牆렓牆橋悼}vB 1110100110010101101101111110101110100000101011101110010110000110100011011110110010111110101000011110101110110010101101011110011110001001100001101110101110100000100100111110011110001001100001101110011010101001100010111110011010000010101111000111110101110110111010011001010110110111111010111010000010101110111001011000011010001101111011001011111010100001111010111011001010110101111001111000100110000110111010111010000010010011111001111000100110000110111001101010100110001011111001101000001010111100011111010111011001000010 e995b7eba0aee5868decbea1ebb2b5e78986eba093e78986e6a98be682bc7d76e995b7eba0aee5868decbea1ebb2b5e78986eba093e78986e6a98be682bc7d7642
UHC 長렮再쾡벵牆렓牆橋悼}v長렮再쾡벵牆렓牆橋悼}vB 111011011111111010001110101110111110111010100010110001001110100110111010101011001110110111101101100011101010100011101101111011011100111011101001110100111111101001111101011101101110110111111110100011101011101111101110101000101100010011101001101110101010110011101101111011011000111010101000111011011110110111001110111010011101001111111010011111010111011001000010 edfe8ebbeea2c4e9baaceded8ea8ededcee9d3fa7d76edfe8ebbeea2c4e9baaceded8ea8ededcee9d3fa7d7642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)