To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???竊??幽??娃??溢g?儀??暗??? 0011111100111111001111111110001010000110001111110011111110010111010010000011111100111111100010001010000100111111001111111000100011101100100000101000011100111111100010110101011000111111001111111000100011000011001111110011111100111111 3f3f3fe2863f3f97483f3f88a13f3f88ec82873f8b563f3f88c33f3f3f
EUC-JP ???竊??幽??娃??溢g?儀??暗??彛 00111111001111110011111111100011111001100011111100111111110011011010100100111111001111111011000010100011001111110011111110110000111011101010001111100111001111111011010110110111001111110011111110110000110001010011111100111111100011111011110011111010 3f3f3fe3e63f3fcda93f3fb0a33f3fb0eea3e73fb5b73f3fb0c53f3f8fbcfa
UTF-8 捻뀁뮆竊섉꼷幽꾩낄娃븐슦溢g솾儀뺧폍暗싲㉡彛 111011111010011010100100111010111000000010000001111010111010111010000110111001111010101110001010111011001000010010001001111010101011110010110111111001011011100110111101111010101011111010101001111010111000001010000100111001011010100010000011111010111011100010010000111011001000101010100110111001101011101010100010111011111011110110000111111011001000011010111110111001011000010010000000111010111011101010100111111011011000111110001101111001101001101010010111111011001000101110110010111000111000100110100001111001011011110110011011 efa6a4eb8081ebae86e7ab8aec8489eabcb7e5b9bdeabea9eb8284e5a883ebb890ec8aa6e6baa2efbd87ec86bee58480ebbaa7ed8f8de69a97ec8bb2e389a1e5bd9b
UHC 捻뀁뮆竊섉꼷幽꾩낄娃븐슦溢g솾儀뺧폍暗싲㉡彛 1110011011110111101100101110110010010010100101011110111110111100100110001110011010000100100011111110101011101011100001001110110010110011101001011110100011011111101110101110110010011010101100001110110011101110101000111110011110011001101100101110101111110000100101011110111110111100100110001110010011011110100110101110101110101000101100101110110010101101 e6f7b2ec9295efbc98e6848feaeb84ecb3a5e8dfbaec9ab0eceea3e799b2ebf095efbc98e4de9aeba8b2ecad

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)